DOCKET NO.; 19603/10303 (CRF D-1043A) 
EXPRESS MAIL NO.: EL 434571921US 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

UTILITY PATENT APPLICATION TRANSMITTAL FORM 
(only for new nonprovisional applications under 37 CFR 1.53(b) 

ASSISTANT COMMISSIONER FOR PATENTS 
Washington, D.C. 20231 
BOX: PATENT APPLICATION 

SIR: 

Transmitted herewith for filing is the patent application (including Cover Pages and PCT Publication 
documents, Specification, Claims, and Abstract (84 pages) of: 

Inventors : Dennis Gonsalves and Sheng-Zhi Pang 

For : TOSPOVIRUS RESISTANCE IN PLANTS 

**If a CONTINUING APPLICATION please mark where appropriate and supply the requisite 
information below and in a preliminary amendment: 

[X] continuation [ ] divisional [ ] Continuation-in-part (CIP) 
of prior application Serial No, 08/495,484 

Prior application information: Examiner : E. McElwain 

Art Unit : 1649 

Enclosed are: 

[X] 5 sheets of informal drawings. 

[ ] Signed Combined Declaration and Power of Attorney ( pages). 

[X] Copy of signed Combined Declaration and Power of Attorney (2 pages) from a prior application 
(1.63(d) (for continuation/divisional). 

[ ] Signed statement deleting inventor(s) named in prior application ( pages) (L63 (d)(2) and 

1.33(b)). 

[X] Incorporation By Reference: The entire disclosure of the prior application, from which a copy 
of the oath or declaration is supplied herewith, is considered as being part of the disclosure of the 
enclosed application and is hereby incorporated by reference therein. 

[ ] Assignment ( pages) of the invention to . 

[ ] Assignment Transmittal Letter. 

[ ] Certified copy of a foreign priority document. 

[ ] Associate power of attorney. 

[ ] Verified statement to establish small entity status ( pages) (newly signed or copy filed in 

prior application). 




-2- 



[X] Preliminary Amendment (11 pages). 

[X] Information Disclosure Statement, form PTO-1449 (5 pages) (in duplicate) and 39 references 
(not attached). 

[X] Sequence Listing (21 pages). 

[X] Statement in Accordance with 37 CFR §§ 1 .821(f) and 1 .825(b) and computer readable 3.5" 
Diskette. 

[X] A self-addressed, prepaid postcard acknowledging receipt. 
[ ] Other: 

The Filing fee has been calculated as shown below: 





(Col. 1) 


(Col. 2) 


SMALL ENTITY 




LARGE ENTITY 


FOR: 


NO. FILED 


NO, EXTRA 




RATE 


FEE 


OR 


RATE 


FEE 


BASIC FEE 


XXXXXXXX 


XXXXXXXX 




XXXX 


$380 


OR 


XXXX 


$760 


TOTAL CLAIMS 


15-20 = 


0 




x 9 = 


$ 


QR 


x 18 = 


$ 


INDEP CLAIMS 


5-3 = 


2 




x39= 


$ 


OR 


X78 = 


$156 


[ ] MULTIPLE DEPENDENT CLAIM PRESENTED 




xl30 = 


$ 


OR 


x260 = 


$ 


*If the Total Claims are less than 20 and Indep. Claims 
are less than 3, enter "0" in Col. 2 


TOTAL 


$ 


QR 


TOTAL 


$916 



[ ] Please charge my Deposit Account No. in the amount of $ . A duplicate 

copy of this sheet is enclosed. 

[X] A check in the amount of $916.00 to cover the filing fee is enclosed. 

[X] The Commissioner is hereby authorized to charge any additional fees which may be required, or 
credit any overpayment to Deposit Account No. 14-1138 . A duplicate copy of this sheet is 
enclosed. 

[X] Address all future communications to: 

Michael L. Goldman 
NIXON PEABODY LLP 
Clinton Square, P.O. Box 1051 
Rochester, New York 14603 



Date: &d>Lr 12, /fff 

Edwin V. Merkel 
Registration No. 40,087 

NIXON PEABODY LLP 
Clinton Square, P.O. Box 1051 
Rochester, New York 14603 
Telephone: (716) 263-1128 
Facsimile: (716)263-1600 




R344634.1 



PATENT 

Docket No,: 19603/10303 (CRF D-1043A) 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicants : Gonsalves et al. ) Examiner: 

) To Be Assigned 

Serial No. : Continuation of 08/495 ,484 ) 

) Art Unit: 

Filed : Herewith ) To Be Assigned 
For : TOSPOVIRUS RESISTANCE IN PLANTS ) 



PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Washington D.C. 20231 

Sir: 

Please amend the above-identified application as follows: 

In the Title : 

Please change the title to read -TOSPOVIRUS RESISTANCE IN PLANTS--. 
In the Specification : 

On page 1, immediately below the Title of the Invention, please insert -This 
application is a continuation of U.S. Patent Application Serial No. 08/495,484 filed 
September 25, 1 995, as a national stage application of PCT Application No. 
PCT/US94/01046 filed January 27, 1994, which claims the priority benefit of U.S. Patent 
Application Serial No. 08/010,410 filed January 29, 1993,-- 

On page 3, line 18, change "in vivo" to — in vivo—. 

On page 4, line 25 "change "viron" to -virion- (each occurrence). 

On page 4, line 27, change "viron" to -virion—. 

On page 5, line 13, change "MK 2 HP0 4 " to -M K 2 HP0 4 ~. 
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On page 5, lines 19-20, change "weight. The" to —weight, the--. 

On page 6, line 23, before "complementary", insert — (SEQ, ID. No. 1) --. 
On page 6, line 24, before "of, insert « (SEQ. ID. No. 2) 
On page 6, line 27, change "clones" to --clone--. 
On page 6, line 32, before "(also", insert - (SEQ. ID, No. 3) -. 
On page 7, line 9, before "(also", insert - (SEQ. ID. No. 4) 
On page 9, line 7, after the word "sequence" but before the please insert 
(SEQ. ID. No. 5)-. 

Please delete lines 8-35 on page 9 and lines 1-17 on page 10, and insert the 

following: 

-- CAAGTTGAAA GCAACAACAG AACTGTAAAT TCTCTTGCAG TGAAATCTCT GCTCATGTCA 6 0 

GCAGAAAACA ACATCATGCC TAACTCTCAA GCTTCCACTG ATTCTCATTT CAAGCTGAGC 12 0 

CTCTGGCTAA GGGTTCCAAA GGTTTTGAAG CAGGTTTCCA TTCAGAAATT GTTCAAGGTT 180 

GCAGGAGATG AAACAAACAA AACATTTTAT TTATCTATTG CCTGCATTCC AAACCATAAC 240 

AGT GTTGAGA CAGCTTTAAA CATTACTGTT ATTTGCAAGC ATCAGCTCCC AATTCGCAAA 3 00 

TGCAAAGCTC CTTTTGAATT AT CAATGATG TTTTCTGATT TAAAGGAGCC TTACAACATT 3 60 

GTTCATGACC CTTCATACCC CAAAGG AT CG GTTCCAATGC TCTGGCTCGA AACTCACACA 42 0 

TCTTTGCACA AGTTCTTTGC AACTAACTTG CAAGAAGATG TAATCATCTA CACTTTGAAC 48 0 

AACCTTGAGC TAACTCCTGG AAAGTTAGAT TTAGGTGAAA GAAC CTTGAA TTACAGTGAA 54 0 

GATGCCTACA AAAGGAAATA TTTCCTTTCA AAAACACTTG AATGTCTTCC ATCTAACACA 60 0 

CAAACTATGT CTTACTTAGA CAGCATCCAA ATCCCTTCAT GGAAGATAGA CTTTGCCAGA 660 

GGAGAAATTA AAATTTCTCC ACAATCTATT TCAGTTGCAA AATCTTTGTT AAAGCTTGAT 72 0 

TTAAGCGGGA TCAAAAAGAA AGAATCTAAG GTTAAGGAAG CGTATGCTTC AGGAT CAAAA 78 0 

TAATCTTGCT TTGTCCAGCT TTTTCTAATT ATGTTATGTT TATTTTCTTT CTTTACTTAT 84 0 

AATTATTTCT CTGTTTGTCA TCTCTTTCAA ATTCCTCCTG TCTAGTAGAA ACCATAAAAA 90 0 

CAAAAAATAA AAATGAAAAT AAAATTAAAA TAAAATAAAA TCAAAAAATG AAATAAAAAC 960 

AACAAAAAAT TAAAAAACGA AAAACCAAAA AGACCCGAAA GGGAC CAATT TGGCCAAATT 102 0 

TGGGTTTTGT TTTTGTTTTT TGTTTTTTGT TTTTTATTTT TTATTTTATT TTTATTTTAT 108 0 

TTTATTTTTA TTTTATTTTT ATTTTATTTA TTTTTTGTTT TCGTTGTTTT TGTTATTTTA 114 0 

TTATTTATTA AG CACAAC AC ACAGAAAGCA AACTTTAATT AAACACACTT AT TT AAAATT 12 0 0 

TAACACACTA AGCAAGCACA AG CAATAAAG ATAAAGAAAG CTTTATATAT TTATAGGCTT 12 60 
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TTTTATAATT TAACTTACAG CTGCTTTCAA GCAAGTTCTG CGAGTTTTGC CTGCTTTTTA 13 2 0 

ACCCCGAACA TTTCATAGAA CTTGTTAAGA GTTTCACTGT AATGTTCCAT AGCAACACTC 13 8 0 

CCTTTAGCAT TAGGATTGCT GGAGCTAAGT ATAGCAGCAT ACTCTTTCCC CTTCTTCACC 144 0 

TGATCTTCAT TCATTTCAAA TGCTTTGCTT TTCAGCACAG TGCAAACTTT TCCTAAGGCT 150 0 

TCCTTGGTGT CATACTTCTT TGGGTCGATC CCGAGGTCCT TGTATTTTGC AT C CTGATAT 1560 

ATAGCCAAGA CAACACTGAT CATCTCAAAG CTATCAACTG AAGCAATAAG AGGTAAGCTA 1620 

CCTCCCAGCA TTATGGCAAG TCTCACAGAC TTTGCATCAT CGAGAGGTAA TCCATAGGCT 16 80 

TGAAT CAAAG GATGGGAAGC AAT CTTAGAT TTGATAGTAT TGAGATTCTC AGAATTCCCA 174 0 

GTTTCTTCAA CAAGCCTGAC CCTGATCAAG CTATCAAGCC TTCTGAAGGT CATGTCAGTG 18 00 

GCTCCAATCC TGTCTGAAGT TTTCTTTATG GTAATTTTAC CAAAAGTAAA ATCGCTTTGC 1860 

TTAATAACCT TCATTATGCT CTGACGATTC TTTAGGAATG TCAGACATGA AATAACGCTC 1920 

AT CTTCTTGA TCTGGTCGAT GTTTTCCAGA CAAAAAGTCT TGAAGTTGAA TGCTACCAGA 198 0 

TTCTGATCTT CCTCAAACTC AAGGTCTTTG CCTTGTGTCA ACAAAG CAAC AATGCTTTCC 204 0 

TTAGTGAGCT TAACCTTAGA CATGATGATC GTAAAAGTTG TTATATGCTT TGACCGTATG 2100 

TAACTCAAGG TGCGAAAGTG CAACTCTGTA TCCCGCAGTC GTTTCTTAGG TTCTTAATGT 216 0 

GATGAT TTGT AAGACTGAGT GTTAAGGTAT GAACACAAAA TTGACACGAT TGCTCT 2216 

On page 10, line 21, after "783" but before the ": M , please insert -(SEQ. ID. 

No. 7)--. 

Please delete lines 22-39 on page 10 and lines 1-18 on page 1 1, and insert the 

following: 



Gin Val Glu Ser Asn Asn Arg Thr Val Asn Ser Leu Ala Val Lys Ser 
15 10 15 

Leu Leu Met Ser Ala Glu Asn Asn lie Met Pro Asn Ser Gin Ala Ser 
20 25 30 

Thr Asp Ser His Phe Lys Leu Ser Leu Trp Leu Arg Val Pro Lys Val 
35 40 45 

Leu Lys Gin Val Ser lie Gin Lys Leu Phe Lys Val Ala Gly Asp Glu 
50 55 60 

Thr Asn Lys Thr Phe Tyr Leu Ser lie Ala Cys lie Pro Asn His Asn 
65 70 75 80 

Ser Val Glu Thr Ala Leu Asn lie Thr Val lie Cys Lys His Gin Leu 
85 90 95 
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Pro lie Arg Lys 
100 

Asp Leu Lys Glu 
115 

Gly Ser Val Pro 
130 

Phe Phe Ala Thr 
145 

Asn Leu Glu Leu 



Asn Tyr Ser Glu 
180 

Leu Glu Cys Leu 
195 

lie Gin lie Pro 
210 

lie Ser Pro Gin 
225 

Leu Ser Gly lie 



Cys Lys Ala Pro 



Pro Tyr Asn lie 
120 

Met Leu Trp Leu 
135 

Asn Leu Gin Glu 
150 

Thr Pro Gly Lys 
165 

Asp Ala Tyr Lys 



Pro Ser Asn Thr 
200 

Ser Trp Lys lie 
215 

Ser lie Ser Val 
230 

Lys Lys Lys Glu 
245 



Phe Glu Leu Ser 
105 

Val His Asp Pro 



Glu Thr His Thr 
140 

Asp Val He He 
155 

Leu Asp Leu Gly 
170 

Arg Lys Tyr Phe 
185 

Gin Thr Met Ser 



Asp Phe Ala Arg 
220 

Ala Lys Ser Leu 
235 

Ser Lys Val Lys 
250 



Met Met Phe Ser 
110 

Ser Tyr Pro Lys 
125 

Ser Leu His Lys 



Tyr Thr Leu Asn 
160 

Glu Arg Thr Leu 
175 

Leu Ser Lys Thr 
190 

Tyr Leu Asp Ser 
205 

Gly Glu He Lys 



Leu Lys Leu Asp 
240 

Glu Ala Tyr Ala 
255 



Ser Gly Ser Lys 
260 



On page 11, line 21, after the word "gene" (second occurrence) but before the 
".", please insert -(SEQ. ID. No. 6)-. 

On page 12, line 20, after the word "gene" but before the ".", please insert — 
(SEQ. ID. No. 8)--. 

On page 13, line 9, after the word "is" and before ":", please insert -(SEQ. 

ID. No. 9)-. 

On page 16, line 23, change "in vitro" to — in vitro—. 

On page 16, line 33, change "a nd" to —and—. 

On page 18, line 31, change "teh" to —the—. 

On page 19, line 26, change "TSWV-BI" to — TSWV-BL--. 

On page 21, line 32, change "viron" to —virion—. 

On page 22, line 7, change "viron" to —virion—. 

On page 22, line 28, change "symptomless" to —Symptomless—. 

On page 22, line 33, change "cold" to —could—. 
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On page 23, line 18, change "viron" to -virion—. 

On page 24, line 8, change "symptom less" to —symptomless-. 

On page 26, line 22, before the insert - (SEQ. ID. No. 10) --. 

On page 26, line 33, before "complementary", insert (SEQ. ID. No. 1 1) 

On page 27, line 1 , change "anshor" to —anchor—. 

On page 27, line 2, before "that", insert - (SEQ. ID. No. 17) ~. 

On page 27, line 4, change "ampllified fragement" to -amplified fragment-. 

On page 27, line 7, before "close", insert - (SEQ. ID. No. 16) 

On page 27, line 28, after the word "is" but before the ":", please insert — 
(SEQ. ID. No. 14)-. 

On page 29, line 22, after the word "are" but before the ":", please insert — 
(SEQ. ID. No. 12)-. 

Please delete lines 23-38 on page 29, lines 1-44 on page 30, and lines 1-3 on 
page 31, and insert the following: 

Met Ser Ser Gly Val Tyr Glu Ser lie lie Gin Thr Lys Ala Ser Val 
15 10 15 

Trp Gly Ser Thr Ala Ser Gly Lys Ser lie Val Asp Ser Tyr Trp lie 
20 25 30 

Tyr Glu Phe Pro Thr Gly Ser Pro Leu Val Gin Thr Gin Leu Tyr Ser 
35 40 45 

Asp Ser Arg Ser Lys Ser Ser Phe Gly Tyr Thr Ser Lys lie Gly Asp 
50 55 60 

lie Pro Ala Val Glu Glu Glu He Leu Ser Gin Asn Val His He Pro 
65 70 75 ' 80 

Val Phe Asp Asp He Asp Phe Ser lie Asn He Asn Asp Ser Phe Leu 
85 90 95 

Ala He Ser Val Cys Ser Asn Thr Val Asn Thr Asn Gly Val Lys His 
100 105 110 

Gin Gly His Leu Lys Val Leu Ser Leu Ala Gin Leu His Pro Phe Glu 
115 120 125 

Pro Val Met Ser Arg Ser Glu He Ala Ser Arg Phe Arg Leu Gin Glu 
130 135 140 

Glu Asp He He Pro Asp Asp Lys Tyr He Ser Ala Ala Asn Lys Gly 
145 150 155 160 

Ser Leu Ser Cys Val Lys Glu His Thr Tyr Lys Val Glu Met Ser His 
165 170 175 
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Asn Gin Ala Leu Gly Lys Val Asn Val Leu Ser Pro Asn Arg Asn Val 
180 185 190 

His Glu Trp Leu Tyr Ser Phe Lys Pro Asn Phe Asn Gin lie Glu Ser 
195 200 205 

Asn Asn Arg Thr Val Asn Ser Leu Ala Val Lys Ser Leu Leu Met Ala 
210 215 220 

Thr Glu Asn Asn He Met Pro Asn Ser Gin Ala Phe Val Lys Ala Ser 
225 230 235 240 

Thr Asp Ser His Phe Lys Leu Ser Leu Trp Leu Arg He Pro Lys Val 
245 250 255 

Leu Lys Gin He Ala He Gin Lys Leu Phe Lys Phe Ala Gly Asp Glu 
260 265 270 

Thr Gly Lys Ser Phe Tyr Leu Ser He Ala Cys He Pro Asn His Asn 
275 280 285 

Ser Val Glu Thr Ala Leu Asn Val Thr Val He Cys Arg His Gin Leu 
290 295 300 

Pro He Pro Lys Ser Lys Ala Pro Phe Glu Leu Ser Met He Phe Ser 
305 310 315 320 

Asp Leu Lys Glu Pro Tyr Asn Thr Val His Asp Pro Ser Tyr Pro Gin 
325 330 335 

Arg He Val His Ala Leu Leu Glu Thr His Thr Ser Phe Ala Gin Val 
340 345 350 

Leu Cys Asn Lys Leu Gin Glu Asp Val He He Tyr Thr He Asn Ser 
355 360 365 

Pro Glu Leu Thr Pro Ala Lys Leu Asp Leu Gly Glu Arg Thr Leu Asn 
370 375 380 

Tyr Ser Glu Asp Ala Ser Lys Lys Lys Tyr Phe Leu Ser Lys Thr Leu 
385 390 395 400 

Glu Cys Leu Pro Val Asn Val Gin Thr Met Ser Tyr Leu Asp Ser He 
405 410 415 

Gin He Pro Ser Trp Lys He Asp Phe Ala Arg Gly Glu He Arg He 
420 425 430 

Ser Pro Gin Ser Thr Pro He Ala Arg Ser Leu Leu Lys Leu Asp Leu 
435 440 445 

Ser Lys He Lys Glu Lys Lys Ser Leu Thr Trp Glu Thr Ser Ser Tyr 
450 455 460 

Asp Leu Glu 
465 

On page 31, line 4, after the word "and", please insert — (SEQ, ID. No. 13)- 
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On page 31, line 41, after the word "is" but before the ":", please insert - 
(SEQ.ID.No. 15)--. 

Please delete lines 42-43 on page 31 and lines 1-16 on page 32, and replace 
with the following: 

TTATGCAACA CCAGCAATCT TGGCCTCTTT CTTAACTCCA AACATT T CAT AGAATTTGTC 60 

AAGAT TATCA CTGTAATAGT C CAT AG CAAT GCTTCCCTTA GCATTGGGAT TGCAAGAACT 120 

AAGTATCTTG GCATATTCTT TCCCTTTGTT TATCTGTGCA T CAT C CATTG TAAATCCTTT 18 0 

GCTTTTAAGC ACTGTGCAAA CCTTCCCCAG AGCTTCCTTA GTGTTGTACT TAGTTGGTTC 240 

AATCCCTAAC TCCTTGTACT TTGCATCTTG AT AT AT GGCA AGAACAACAC TGATCATCTC 3 00 

GAAGCTGTCA ACAGAAGCAA TGAGAGGGAT ACTACCTCCA AG C ATTATAG CAAGTCTCAC 36 0 

AGATTTTGCA TCTGCCAGAG GCAGCCCGTA AGCTTGGACC AAAGGGTGGG AGGCAATTTT 420 

TGCTTTGATA ATAGCAAGAT TCTCATTGTT TGCAGTCTCT TCTATGAGCT TCACTCTTAT 4 80 

CATGCTATCA AGCCTCCTGA AAG TCATATC CTTAGCTCCA ACTCTTTCAG AATTTTTCTT 54 0 

TAT CGTGAC C TTACCAAAAG TAAAATCACT TTGGTTCACA ACTTTCATAA TGCCTTGGCG 60 0 

ATTCTTCAAG AAAGTCAAAC ATGAAGTGAT ACTCATTTTC TTAATCAGGT CAAGATTTTC 6 60 

CTGACAGAAA GTCTTAAAGT TGAATGCGAC CTGGTTCTGG TCTTCTTCAA ACTCAACATC 72 0 

TGCAGATTGA GTTAAAAGAG AGACAATGTT TTCTTTTGTG AGCTTGACCT TAGACAT 777 

The complementary nucleic acid molecule has a nucleotide sequence as shown 
in SEQ. ID, No. 19.--. 

On page 32, line 24, change "vial" to —viral--. 
On page 33, line 29, change "the" to —The—. 
On page 33, line 32, change "differs" to —differ--. 

On page 34, line 17, change "exactly same positions" to —exact same position- 
On page 35, line 12, change "event" to —event,—. 
On page 35, line 13, change "Tospovirus" to —Tospovirus—. 
On page 36, line 10, change "generated" to —generate—. 
On page 36, line 27, change "Np" to --NP-. 
On page 39, line 16, after "virus" insert -was-. 
On page 39, line 17, change " 5 and is" to -and was—. 
On page 40, line 25, change "resistant" to -resistance-. 
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On page 41 , line 3, before "and", insert -- (SEQ. ID. No. 20) 
On page 41 , line 4, before " ,", insert - (SEQ. ID. No. 21) 
On page 41, line 10, after "plants" insert —that—. 
On page 41, line 11, change "greenhouse. All" to —greenhouse, all—. 
On page 43, line 9, after "appearance" insert 
On page 43, line 15, change "pants" to —plants—. 
On page 45, line 19, change "vecor" to — vector 
On page 45, line 33, before ",", insert - (SEQ. ID. No. 22) 
On page 45, line 35, before ",", insert - (SEQ. ID. No. 23) --. 
On page 46, line 31, before "was", insert - (SEQ. ID. No, 24) -. 
On page 47, line 11, change "Hewlet Scanjet" to —Hewlett Scanjet™—. 
On page 51, line 2, change "Tospoviruses" to —Tospoviruses—. 
On page 53, line 11, change "Tospo virus" to —Tospovirus--. 
On page 53, line 14, change "realated" to —related—. 
On page 53, line 16, change "RNE" to - RNA --. 
On page 53, line 28, change "generqated" to —generated—. 
On page 54, line 2, before "/', insert - (SEQ. ID. No. 25) 
On page 54, line 4, before "for", insert - (SEQ. ID. No. 26) -. 
On page 54, line 6, before "for", insert - (SEQ. ID. No. 27) -. 
On page 54, line 10, before "which", insert - (SEQ. ID. No. 28) 
On page 54, line 12, before "for", insert - (SEQ. ID. No. 29) 
On page 54, line 15, before "for", insert - (SEQ. ID. No. 30) - and chan< 
"untranslatablesecond" to — untranslatable second 

On page 54, line 33, change "expressin" to —expression—. 

On page 55, line 23, change "fragemnts" to — fragments— . 

On page 55, line 32, change "fragements" to -fragments—. 

On page 55, line 30, change "prodcing" to — producing --. 

On page 55, line 34, change "expressin" to -- expression 

On page 56, line 4, change "trnscription" to —transcription—. 

On page 56, line 6, change "pant" to — plant — and change "resluting" to 

resulting 

On page 56, line 32, change "variojs" to — various 
On page 57, line 23, change "portin" to — portion — . 
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Please delete the sequence listing from the specification, page 57, line 26 to 
page 75, line 9 and insert the new sequence listing (attached hereto) as a separately paginated 
document which is part of the present application. 

In the Claims : 

Please delete claims 1-3 and add new claims 4-18 as follows: 

—4. An isolated DNA molecule capable of transcription to a messenger 
RNA which is modified from a form encoding a nucleocapsid protein of an L serogroup 
Tospovirus so that it does not translate to the nucleocapsid protein, wherein, when the DNA 
molecule is transformed into a plant cell, it is capable of being transcribed into messenger 
RNA which exists at low level density readings of 15-50 as measured using a Hewlett 
ScanJet and Image Analysis Program.— 

—5. A DNA molecule according to claim 4, wherein the L serogroup 
Tospovirus is selected from the group consisting of TSWV-10W and TSWV-BL.— 

—6. A recombinant DNA expression system comprising an expression 
vector into which is inserted a heterologous DNA molecule according to claim 4.— 

—7. A plant cell transformed with a heterologous DNA molecule according 

to claim 4.— 

—8. A transgenic plant containing the DNA molecule according to 

claim 4.— 

—9. A method of treating a plant cell comprising: 

transforming a plant cell with a DNA molecule according to claim 4 

and 

transcribing the DNA molecule under conditions effective to maintain 
the messenger RNA in the plant cell at low level density readings of 15-50, as measured 



RI34689.1 



Continuation of 08/495,484 

-10- 
using the Hewlett ScanJet and Image Analysis Program, wherein the plant cell acquires 
resistance to an L serogroup Tospovirus - 

-10. A method of imparting to a plant cell resistance to infection by an I 
serogroup Tospovirus, said method comprising: 

transforming a plant cell with a DNA molecule encoding a 
nucleocapsid protein of an L serogroup Tospovirus, wherein the DNA molecule is expressed 
to produce an ELISA level of OD405nm = 0.50 to 1.00 of the nucleocapsid protein in the 
plant cell, as measured using an antibody raised against the nucleocapsid protein of 
Tospovirus isolate TS WV-BL, and the plant cell acquires resistance to an I serogroup 
Tospovirus. — 

-11. A method according to claim 1 0, wherein the L serogroup Tospovirus 
is selected from the group consisting of TSWV-10W and TSWV-BL.-- 

-12. A method according to claim 10, wherein the I serogroup Tospovirus is 
selected from the group consisting of INSV-Beg and INSV-LL- 

-13. A method of imparting to a plant cell resistance to infection by L 
serogroup Tospoviruses, said method comprising: 

transforming a plant cell with a DNA molecule encoding a 
nucleocapsid protein of an L serogroup Tospovirus, wherein the DNA molecule is expressed 
to produce an ELISA level of OD 405nm - 0.02 to 0.20 of the nucleocapsid protein in the 
plant cell, as measured using an antibody raised against the nucleocapsid protein of 
Tospovirus isolate TSWV-BL, and the plant cell acquires resistance to the L serogroup 
Tospovirus and a second L serogroup Tospovirus - 

-14. A method according to claim 13, wherein the L serogroup Tospovirus 
is selected from the group consisting of TSWV-10W and TSWV-BL.-- 

-15. A method of imparting to a plant cell resistance to infection to a 
Tospovirus comprising: 
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transforming a plant cell with a DNA molecule encoding a 



nucleocapsid protein or polypeptide of a serogroup L Tospovirus under conditions effective 
to render the plant cell resistant to infection by serogroup L Tospovirus isolates and a 
serogroup 2 Tospovirus isolate.-- 

-16. A method according to claim 1 5, wherein the serogroup 2 Tospovirus 
isolate is a TSWV-B Tospovirus isolate.-- 



-17. A transgenic plant containing a heterologous DNA molecule encoding 



a nucleocapsid protein or polypeptide of a serogroup L Tospovirus, wherein the transgenic 
plant, upon challenge with both a Tospovirus isolate belonging to serogroup L and a 
Tospovirus isolate belonging to serogroup 2, exhibits resistance to both the L serogroup 
Tospovirus isolate and the serogroup 2 Tospovirus isolate.- 

-18. A transgenic plant according to claim 17 5 wherein the serogroup 2 
Tospovirus isolate is a TSWV-B Tospovirus isolate.- 
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TOMATO SPOTTED WILT VIRUS 

Viruses In the Tospovirus genus Infect a wide variety of plant 
species, particularly tobacco, peanut, vegetables and ornamental plants. 
5 two virus species) tdmale Spotted wilt virus (TSWV) and impatiens 
hecrdtl6 spot virus (INSV) &fe recognized Within the Tospovirus genus. 

tomato Shotted Wilt Virus (TSWV) is unique among plant viruses 
in that the nucleic acid-protein complex is covered by a lipoprotein 
envelope end It is the only thrip transmitted virus. This virus has 
1 0 recently been classified as the Tospovirus genus df the Bunyaviridae 
family. TSWV virions contain a 29K nucleocapsid protein ("NP" or "N"), 
two membrane-associated glycoproteins (58K and 78K) and a large 
200K protein presumably tor the viral transcriptase [see J. Gen. Virol. 
71:2207 (1991); Virol. 56:1 2 (1973); and J. Gen. Virol. 36:267 (1977)]. 
1 5 The virus genome consists of three negative-strand (-) RNAs designated 
L RNA (8900 nucleotides), M RNA (5400 nucleotides) and S RNA (2900 
nucleotides) Isee J. Gen. Virol. 36:81 (1977); J. Gen. Virol. 53:12 (1981); 
and J. Gen. Virol. 70:3469 (1989)], each of which is encapsulated by the 
NP. The partial or full-length sequences of S RNAs from three TSWV 
20 isolates reveals the presehce of two open reading frames (ORF) with an 
ambisense gene arrangement fsee J. Gen Virol. 71:1 (1990) and J. Gen. 
Virol. 72:461 (1991)]. The larger open reading frame is located on the 
viral RNA strand and has the capacity to encode a 52K nonstructural 
protein. The smaller ORF is located on the viral complementary RNA 
25 strand and is translated through a subgenomic RNA into the 29K NP. 

The ambisense coding strategy is also characteristic of the 
TSWV M RNA, With the open reading frames encoding the 58K and 78K 
membrarie-associated glycoproteins. The TSWV L RNA has been 
sequenced to encode a large 200K protein presumably for the viral 
3 0 transcriptase. 

Two TSWV serogroups, "L" and "I", have been identified and 
characterized based on serological analysis of the structural proteins 
and morphology of cytopathic structures (see J. Gen Virol. 71:933 
(1990) and Phytopathology 81:525 (1991)]. They have serologically 
3 5 conserved G1 and G2 glycoproteins, but the NP of the "I" serogroup is 
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serologically distinct from that of the V serogroup. Comparison of the 
HP bdtween the "L* &hd 1* serogroups has shown 62% and 67% Identities 
at nucleotide and amino acid levels, respectively [see J. Gen. Virol. 
72:2597 (1991)]. 

5 TSWV has a Wide host range, Infecting more than 360 plant 

species of 50 families and causes significant economic losses to 
vegetables and drnartiehtal plants worldwide. The V serogroup has 
been found extensively In field crops such as vegetables and Weeds, 
While the "I" serogroup has been largely confined to ornamental crops. 
10 A cucurbit isolate has recently been identified [se'e v Plant Disease 

68:1006 (1984)] as a distinct isolate because it systemically infects 
watermelon and other curcurbits and its UP is serologically unrelated 
to that of either serogroup. Although the spread of the TSWV disease 
can sometimes be reduced by breeding resistant plants or using non- 

1 5 genetic approaches, complete control of the disease by these 

conventional methods has generally proven to be difficult [see Plant 
Disease 73:375 (1989)]. 

Since 1986, numerous reports have shown that transgenic plants 
with the coat protein (CP) gene of a virus are often resistant to 
20 infection by that virus. This phenomenon is commonly referred to as 
coat protein-mediated protection (CPMP). The degree of protection 
ranges from delay in symptom expression to the absence of disease 
symptoms and virus accumulation. Two recent independent reports 
[see Biol. Technology 9:1363(1991) and Mol. Plant-Microbe Interact. 

2 5 5:34 (1992)} showed that transgenic tobacco plants expressing the 

nucleocapsid protein (NP) gene of TSWV are resistant to infection by 
the homologous isolate. However, since TSWV is widespread with many 
biologically diverse isolates, it is very important to test the 
effectiveness of the transgenic plants to resist infections by different 

3 0 TSWV isolates. The findings of the present invention expand on those of 

the previous reports by demonstrating that transgenic plants according 
to the present invention Showed resistance to two heterologous 
isolates of the *L" serogroup and an isolate of the "I" serogroup." We 
also show that resistance to the two heterologous isolates of the "L H 
3 5 serogroup was mainly found in plants accumulating very low, if any, 



levels of NP, while transgenic plants that accumulated high levels of NP 
were resistant to iha Isblate of the "I" serogroup. 

However, no resistance was observed to a Brazilian isolate, 
although tha plants that accumulated high levels of the N protein did 
5 display a delay in symptohi expression. This Brazilian isolate, 

designated TSWV-6 Mi tha N protein that was Serologically distinct 
frohl the 1* and Sfeftigrotips and biologically differs from a curcurbit 
isolate in that the TSWV-fe does not systemically Infect melons or 
squash . Therefore, 6h§ aspect of the present Invention is to 
1 0 characterize the TSWV-B by cloning and sequencing of its S RNA and 
comparisons With the published sequences of other TSWV isolates. 

Various aspects of the present invention Will become readily 
apparent from the detailed description of the present invention 
including the following example, figures and data. 
15 In the Figures; 

Fig. 1 depicts the strategy for cloning the NP gene from viral RNA 
according to the present invention; 

Fig. 2 depicts the in vivo transient expression of the nucleocapsid 
protein (NP) gene of tomato spotted wilt virus according to the present 
20 invention in. tobacco protoplasts; 

Fig. 3 depicts the location of the sequenced cDNA clones in the 
TSWV-B S RNA according to the present invention; 

Fig. 4 depicts a dendogram showing relationships among TSWV 
isolates according to the present invention; 

2 5 Fig. 5 depicts the serological relationship of TSWV isolates 

described herein; 

Fig. 6 depicts the correlation of the level of nucleocapsid protein 
(NP) accumulation in transgenic plants with the degree of resistance to 
TSWV isolates; 

3 0 Fig. 7 depicts the TSWV-BL N coding sequences introduced into 

transgenic plants in accordance with one aspect of the present 
invention; and 

Fig. 8 depicts the TSWV-BL half N gene fragments introduced into 
plants in accordance with one aspect of the present invention. 
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More specifically, figure 2 depicts transient expression of the NP 
gene In which the constructs were transferred into tobacco mesophyll 
protoplasts using polyethylene glycol (PEG). The transformed 
protoplasts were subsequently incubated for two days for the 
expression of the UP gehfi. Proteins Were extracted from the 
protoplasts and tested fdf the NP by double, antibody sandwich enzyme- 
linked Immunosorbent assay (DAS-ELISA) using antibodies against the 
TSWV NP. UP' and NP+ represent the protoplasts transformed with 
plasmlds P&I525-NP- and pBI525-NP+, respectively. Concentration of 
the aritibodies for coating: 5 |ig/ml: dilution of the enzyme conjugate: 
1 :250. Data were taken 30, 60 and 90 min. after addition of substrate. 

In figure 3, the five overlapping cDNA clones are shown to scale 
below a S RNA map of TSWV-B. These clones were synthesized with 
random primers from double-stranded RNA isolated from N. benthamiana 
plants Infected with TSWV-B. 

In figure 4, the sequences were compared using the pileup 
program of the GCG Sequence analysis software package. Horizontal 
lines are proportional to the genetic distance while vertical lines are 
of arbitrary length and have no significance. 

More specifically, In figure 5, N. benthamiana Domin. were 
infected with TSWV isolates [TSWV-BL (a lettuce Isolate), Arkansas, 
10W pakchoy (TSWV-10W), Begonia, and Brazil (TSWV-B)). An infected 
leaf disc (0.05 gram) was ground in 12 ml of the enzyme conjugate 
buffer and analyzed by DAS-ELISA using antibodies raised against 
TSWV-BL viron (BL viron), or the NP of TSWV.BL (BL-NP), or TSWV-I (I- 
NP). Concentration of antibodies for coating were 1u.g/ml; dilution of 
conjugates were 1:2000 for BL viron, 1:250 for BL-NP, and 1:1000 for l- 
NP. The results were taken after 10 minutes (BL), 50 minutes (BL-NP), 
or 30 minutes after adding substrate. 

With regard to figure 6, transgenic plants were assayed in DAS- 
ELISA for NP accumulation With antibodies raised against the NP of 
TSWV-BL. Plants were read 150 min. after adding substrate and the 
transgenic plants were grouped into four categories: OD405 nm * smaller 

than 0.050, OD405nm between 0.050 to 0.200, OD405nm between 0.200 
to 0.400, and OD4o5nm greater than 0.400. The OD405nm readings of 
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control NP {-) plants Were from zero to 0.05. The same plants were 
challenged with either the Arkansas (Ark) and 10W pakchoy (10W) 
isolates or the Begdlila Isolate and the susceptibility of each plant was 
recorded about 12 days after inoculation. The results were pooled from 
5 fitty-ofte Rl NP (+) plarit§ inoculated with the Arkansas and 10W 

pakchoy isolates and one hundred thirty-nine Rt NP(+) plants inoculated 
With the Begonia isolate. Numbers above bars represent total numbers 
of Ri NP(+) plants tested. 

EXAMPLE I 
10 isolation of TSWV-BL RNAs: 

the TSWV-BL isolate was purified from Datura stramonium L. as 
follows: the infected tissues were ground in a Waring Blender for 45 
sec with three volumes of a buffer (0.033 M KH2PO4, 0.067 MK2HPO4, 
0.01 M Na2S03). The homogenate was filtered through 4 layers of 

1 5 cheesecloth moistened With the above buffer and centrifuged at 7,000 

rprrt for 15 min. The pellet Was resuspended in an amount of 0.01 M 
Na2S03 equal to the original weight of tissue and centrifuged again at 
8,000 rpm for 15 min. After the supernatant was resuspended in an 
amount of 0.01 M Na2S03 equal to 1/10 of the original tissue weight. 
20 The Virus extract was centrifuged at 9,000 rpm for 15 min. and the 
supernatant was carefully loaded on a 10-40% sucrose step gradient 
made up in 0.01 M Na2S03. After centrifugation at 23,000 rpm for 35 
min., the virus zone (about 3 cm below meniscus) Was collected and 
diluted with two volumes of 0.01 M Na2S03. The semi-purified virus 

2 5 was pelleted at 27,000 rpm for 55 min. 

EXAMPLE II 

Purification of TSWV and viral RNAs: 

The TSWV-BL isolate (see Plant Disease 74:154 (1990)] was 
purified from Datura strarrionium L, as described in Example I. The 

3 0 purified virus was resuspended in a solution of 0,04% of bentonite, 10 

ng/fnl of proteinase K, 0.1 M ammonium carbonate, 0.1% (w/v) of 
sodium diethyldithiocarbanate, 1 mM EDTA, and 1% (w/v) of "sodium 
dodecyl sulfate (SDS), incubated at 65°C for 5 min., and immediately 
extracted from H20-saturated phenol, followed by another extraction 
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with chloroform/isoamy! alcohol (24:1). Viral RNAs were precipitated 
in 2.5 volumes of ethahol and dissolved in distilled H2O. 

EXAMPLE III 

cDNA and PCR-based HP gehe cloning: 
5 The first strand cbNA Was synthesized from' purified TSWV-BL 

RNAs using random primers as described by Gubler and Hoffman tsee 
Gehe 25:263 (1983)]. The Second strand Was produced by treatment of 
the sample with RNase H/DNA polymerase. The resulting, double- , 
stranded cDNA sample Was size-fractionated by sucrose gradient 
10 centrifugation, methylated by EcoRI methylase, andvEcoRI linkers were 
added. After digestion with EcoRI, the cDNA sample was ligated into 
the EcoRI site of pUC18, whose 5'-terminal phosphate groups were 
removed by treatment With calf intestinal alkaline phosphotase. E coli 
DH5 a competent cells (Bethesda Research Laboratories) were 
1 5 transformed and clones containing TSWV cDNA inserts Were first 
selected by plating on agar plates containing 50 fig/ml of ampicillin, 
IPTG, and X-gal. Plasmid DNAs from selected clones were isolated 
using an alkaline lysis procedure {see BRL Focus 11:7 (1989)], and the 
insert sizes were determined by EcoRI restriction enzyme digestion 
20 followed by DNA transfer onto GeneScreen Plus nylon filters (DuPont). 
Plasmid clohes that contained a TSWV-BL S RNA cDNA insert were 
identified as described below by hybridizing against a 32p-labelled 
oligomer (AGCAGGCAAAACTCGCAGAACTTGC) complementary to the 
nucleotide sequence (GCMGTTCTGCGAGTTTTGCCTGCT) of the TSWV- 
25 CPNH1 S RNA [see d. Gen. Virol. 71:001 (1990)]. Several clones were 
identified and analyzed on agarose gels to determine the insert sizes. 
The clones pTSWVS-23 was found to contain the largest cDNA insert, 
about 1.7 kb in length. 

The full-length NP gene was obtained by the use of polymerase 
3 0 chain reaction (PCR). First-strand cDNA synthesis was carried out at 
37°C for 30 mln. in a 20 U.I reaction mixture using oligomer primer 
JLS90-46 (5'-> 3') AGCTAACCATGGTTAAGCTCACTAAGGAAAGQ (also 
used to synthesize the nucleocapsid gene of TSWV-10W) which is 
complementary to the S RNA in the 5' terminus of TSWV NP gene 
3 5 (nucleotide positions 2751 to 2773 of the TSWV-CPNH1). The reaction 
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mixture contained 1.5 |ig of viral RNAs,1 u,g ot the oligomer primer, 0.2 
mM of each dNTP, IX PGR buffer (the GeneAmp kit, Perkin-Elmer-Cetus), 
20U of RNAs in RibonUclease inhibitor (Promega), 2.5 mM of MgCl2, and 
25U of AMV reverse transcriptase (Promega Corporation). The reaction 
5 was terminated by heating at 95°C for 5 min. and cooled on ice. Then 10 
lit of the cDNA/RNA hybrid Was used to PCR-amplify the HP gene 
according to manufacturer's instructions {Peridn-Elrrier-Cetus) using 1 
jig each of oligomer primers JLS90-46 and JLS90-47 (5'->3'), 
AGGATTCCATGGTTAAcAgACTAAGCAAGCAC (also used to synthesize the 

1 0 nucleotide gene of f SW\M0W), the latter oligomer being identical to 
the S RNA in the 3' honcoding region of the gene (nucleotide positions 
1919 to 1938 of the TSWV-CPNH1). A typical PCR cycle was 1 min. at 
92°G (denaturing), 1 min. at 50°C (annealing), and 2 min. at 72°C 
(polymerizing). The sample was directly loaded and separated on a 1.2% 

1 5 agarose gel. The separated HP gene fragment was extracted from the 
agarose gel, ethanol-precipitated and dissolved in 20 ut of distilled 
H2O. 

EXAMPLE IV 

Construction of plant expression and transformation vectors. 

2 0 the gel-isolated HP gene fragment from Example III was digested 

with the restriction enzyme rVco/ in 50 fxl of a reaction buffer [50 mM 
Tris-HCI (pH 8.0), 10 mM MgCl2, 0.1 M NaCI] at 37°C for 3 hours, and 
directly cloned into A/co/-digested plant expression vector pB1525. The 
resulting plasmids were Identified and designated as pB1525-NP+ in 

2 5 the sense orientation relative to the cauliflower mosaic virus (CaMV) 

35S promoter, and as pB1525-NP"in the reverse orientation. The 
ability of this expression cassette to produce the HP was determined by 
transient expression of the HP gene in Nicotiana tobacum protoplasts, 
as described by Pang et al [see Gene 112:229 (1992)]. The expression 
. 3 0 cassette containing the HP gene was then excised from pB1525-NP + by 
a partial digestion with /-//ndlll/EcoRI (since the HP gene contains 
internal Hihd\\\ and EcoRI sites), and ligated into the plant K 
transformation vector pBlNl9 (Clontech Laboratories, Inc.) that had 
been cut with the same enzymes. The resulting vector, pBIN19-NP + and 

3 5 the control plasmid pBlNl9 were transferred to A. tumefaciens strain 



LBA4404, using the procedure described by Holsters et al [see Mol. Gen. 
Gerlet. 163:181 (1978)]. 

Nucleotide sequence analyses of the inserts in clones pTSWV-23 
and Pb1525-NP+were determined using the dideoxyribonucleotide 
5 fnethSd, T7 polymerase (U.S. Biochemicals, SeqU6riase T ^), and the 
double-strahded sequencing procedure described by Siemieniak et al 
[see Analyt. Biochem. 192:441 (1991)1. Nucleotide sequences were 
determined from both DNA strands and this information was compared 
With the published Sequences of TSWV isolates CPNH1 using computer 
1 0 programs available from the Genetics Computer Group (GCG, Madison, 
Wl). 

Transient expression of the NP gene in tobacco protoplasts were 
also prepared. Plasmid DNAs for clones pTSWVS-23 and pUC18cpphas 
TSWV-NP (containing the PCR-engineered NP gene insert) were isolated 
1 5 using the large scale alkaline method. The PCR-engineered NP gene 
insert was excised from clone pBIS25-NP+ by Ncol digestion to take 
advantage of the available flanking oligomer primers for sequencing. 
The expression cassette pUC18cpphas is similar to pUC18cpexp except 
that it utilizes the poly(A) addition signal derived from the Phaseolus 

2 0 vulgaris seed storage gene phaseolin. These plasmid DNAs were 

subjected to two CsCI-ethidium bromide gradient bandings, using a 
Beckman Ti 70.1 fixed angle rotor. DNA sequences were obtained using 
dideoxyribonucleotides and the double-stranded plasmid DNA sequencing 
procedure described above. Nucleotide sequence reactions were 
25 electrophoresed on one-meter long thermostated (55°C) sequencing gels 
and nucleotide sequence readings averaging about 750 bp were obtained. 
Nucleotide sequences were determined from both DNA strands of both 
cloned inserts to ehsure accuracy. Nucleotide sequence information 
from the TSWV-BL S RNA isolate was compared as discussed below, 

3 0 with TSWV isolates CPNH1 and L3 using computer programs (GCG, 

Madison, Wl). 

The nucleotide and deduced amino acid sequences of cloned cDNA 
and PCR-engineered insert of TSWV-BL S RNA and their comparison 
with the nucleotide sequence of TSWV-CPHN1 S RNA are shown below. 
3 5 The nucleotide sequence of the TSWV-BL S RNA clones pTSWVS-23 
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(TSWV-23) and pBI525-NP+ (TSWV-PCR) were obtained using the 
double-stranded dideoxynucleotide sequencing procedure of Siemieniak, 
and their sequences are compared with the relevant regions of the 
nucleotide sequence tif the TSWV-CPNH1 S RNA reported in GeneBank 
Accession No. D06645. the nucleotide sequence oftSWV-CPNHl S RNA 

has been reported by be Haan (1990) and is represented by the following 
sequence: 

O&GTTGAAA QCAACAACAG MCTGTAAAT TCTCTTGCAG TGAAA1UTCT , 50 

GCTCATGTCA GCAGAAAACA ACATCATGCC TAACTCTCAA GCTTCCACTG 100 

AITCTCRTTr CAAGCTGAGC CTCIGGCTAA GGGTTCCAAA ' GGTITI'GAAG 150 

CAGGTTTCCA TTCAGAAMT GTTCAAGGET GCAGGAGATG AAACAAACAA 200 

AACATCTIAT TTATCTATTG CCTGCA1TCC AAACCATAAC AGTGTTGAGA 250 

CAGCTTTAAA CATEACTGTT ATITGCAAGC ATCAGCTCCC AAITCGCAAA 300 

TGCAAAGCTC CTTTTGAATr ATCAATGATG TTTTCTGATT TAAAGGAGCC 350 

TTACAACATT GTTCATSAOC CTICATACCC CAAAGGATCG GITOCAATQC 400 

TCTQGCTCGA MCTCACACA TCTTTGCACA AGITCTITGC MCTAACTTG 450 

CAAGAAGATG TAATCATCIA CACTTTGAAC AACCTTGAGC TAACTCCTGG 500 

AAAGTTAGAT TTAQGTGAAA GAACdTGAA TmCAGTGAA GATGCCTACA 550 

AAAGGAAAIA TITCCTITCA AAAACACTTG AATGTCTTCC ATCIAACACA 600 

CAAACTATGT CTIMTAGA CAGCATCCAA ATOCCTTCAT GGAAGATAGA 650 

CTTT3CCAGA GGAGAAATTA AAATITCTCC ACAATCTATT TCAGTTQCAA 700 

AATCTTTGTT AAAQCTTGAT TTAAGCGGGA TCAAAAAGAA AGAATCTAAG 750 

GTEAAGGAAG CGTATGCTTC AGGATCAAAA TAATCXTGCT TTGTCCAGCT 800 

TTTrCTAATT ATGTTATGTT TATTITCTIT CTITACTTAT AATTATITCT 850 

CTGTITGTCA TCTCTITCAA ATTCCTCCTG TCIAGTAGAA ACCATAAAAA 900 

CAAAAAATAA AAATGAAAAT AAAATEAAAA TAAAATAAAA TCAAAAAATG 1000 

AAATAAAAAC AACAAAAAAT TAAAAAAOGA AAAAQCAAAA AGACCCGAAA 1050 

GGGACCAATT TGGGCAAATT TGGGTTTTGT TmGTi'i'iT TGriTlTlGT 1100 

TITTTATITr TrATTITATT TTTATITTAT TTEAlTiTiA TTTTMTTTT 1150 

ATITEATITA TiTiTl^TiT TCGTTGTTTT TGTTATTTTA TEATITATTA 1200 

AGCACAACAC ACAGAAAGCA AACTTTAA3T AAACACACTT ATTTAAAATT 1250 

-T^CADO^.AGX^AAGCJCA AGCAATAAAG ATAAAGAAAG C1TEA1A1AI. ^ 1300 

TEATAGGCTT TITIATAAIT TAACITACAG CT3CTTTCAA GCAAGTTC1G 1350 

CGAGTnTGC CTGCTITTTA ACCCGGAACA TITCATAGM CTTGTTAAGA 1400 

GTITCACTGT AATGITCCAT AGCAACACTC CCITEAGCAT TAGGATTGCT 1450 
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GGAGCTAAGT ATAGCAGCAT ACICITTCCC CTTCTTCACC TGATCTTCAT 1500 
KMTTCAAA TGCTITGCTT 1TCAGCACAG TGCAAACITT TCCTAAGGCT 1550 
TOCTTGGTGT OfflACTIX*n? TGGGTCGATC COGAGGTCCT TGTA1TITGC 1600 
ATCCTGATAT ATAGOCAAGA CAACACTGAT CATCTCAAAG CTATCAACTG 1650 
5 AAGCAAIAAG AGGTAAGCTA CCTCOCRGCA TIATGGCAAG TCTCACAGAC 1700 
TTTGGtoCAT GGAGAGGTAA TCCATAGGCT TGAATCAAAG GATQGGAAGC 1750 
AATCITAGAT TIX3ATAGTAT TGAGATTCTC AGAATTOCCA WITILTICAA 1800 
CAAGOCTGAC CCTGATCAAG CTATCAAGCC TDCTGAAGGT CATGTCAGTG 1850 
CCTGCAATOC TGTCTGAAGT TllLTl'IATG GTAATTTTAC CAAAAGTAAA 1900 

1 0 ATCGCTTTGC TIAATAACCT TGATEATGCT CTGACGATTC TTTAGGAATG 1950 
TCAGACATGA AATAACGCTC ATCTTCTTGA TCIGGTQGAT GTHTCCAGA 2000 
CAAAAAGTCT TGAAGTTGAA TGCTACCAGA TTCTGATCTT OCTCAAACTC 2050 
MGGTCTTTG (XTTGTGTCA ACAAAQCAAC AATGCTrT£g..TmGmGCT 2100 
.TAAtX.TEAGA CATGATGATC GTAAAAGTTG TEATAGCTTT GACOGTATGT 2150 

1 5 AACTCAAGGT GCGAAAGTGC AACTCTGTAT CCCGCAGTCG TITCTTAGGT 2200 
TCITAATGTG ATGAnTGTA AGACTGAGTG TTAACGTATG AACACAAAAT 2250 
TGACAOGATT GCTCT 2265 

The incomplete deduced amino acid sequence of the nonstructural 
protein gene on TSWV-CPNH1 S RNA is provided below beginning with 

20 nucleic acid at position 1 and ending with the nucleic acid codon ending 
at position 783: 

Gin Val Glu Ser Asn hm hzg HuT Val Asn Ser Leu Ala Val Lys 

5 10 15 

Ser Leu Leu Met Ser Ala Glu Asn Asn He Mat Pro Asn Ser Gin 
25 20 25 30 

Ala Ser Thr Asp Ser His Phe Lys Leu Ser Lsu Trp Leu Arg Val 
35 • 40 45 

Pro Lys Val Leu Lys Gin Val Ser He Gin Lys Leu Phe Lys Val 
50 55 60 

3 0 Ala Gly Asp Glu Thr ESQ Lys 2hr Phe Tyr Leu Ser He Ala Cys 

65 70 75 

He Pro Asn His Asn Ser Val Glu Thr Ala Leu Mn lie TDX Val 
80 * 85 90 

He Cys Lys His Gin Leu Pro He Arg Lys Cys Lys Ala Pro Phe 
3 5 95 100 ,105 

Glu Leu Ser Met Met Phe Ser Asp Leu Lys Glu Pro Tyr Asn" He 
110 115 120 

Val His Asp Pro Ser Tyr Pro Lys Gly Ser Val Pro Met Leu Trp 
125 130 135 
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Leu Glu Thr His Thr Ser Leu His Lys Phe Phe Ala Thr Asn Leu 

140 145 150 

Gin Glu Asp Val lie lie Tyr Thr Leu Asn Asn Leu Glu Leu Thr 

155 160 165 

5 Pro Gly Lys Leu Asp Leu Gly Glu Arg Thr Leu hsn Tyr Ser Glu 

170 175 180 

Asp Ala Tyr Lys Arg Asp Tyr Phe Leu Ser Lys Thr Leu Glu Cys 

185 190 195 

Leu Pro Ser Asn Thr Gin Thr Mat Ser Tyr Leu Asp Ser lie Gin 
10 200 205 210 

* 

He Pro Ser Trp Lys He Asp Phe Ala Arg Gly Glu He Lys He 
215 220 , 225 

Ser Pro Gin Ser He Ser Val Ala Lys Ser Leu Leu Lys Leu Asp 
230 235 240 

1 5 Leu Ser Gly He Lys Lys Lys Glu Ser Lys Val Lys Glu Ala Tyr 

245 250 255 

Ala Ser Gly Ser Lys 
260 

The nucleotide sequence for TSWV-23 depicted below compares 
20 closely with the TWSV sequence given above, and contains one-half of 
the nonstructural gene and one half of the nucleocapsid protein gene. 

AAATTCTCTT GCAGTGAAAT CTX3X3CTCAT GTTAGCAGAA AACAACATCA 50 

TGCCEAACTC TCAAGCTTTT GTCAAAGCTT CTA.CTGMTC TAATTTCAAG 100 

CTGAGCCTCT GGCTAAGGGT TCCAAAGGTT TTGAAGCAGA TTTCCATTCA 150 

2 5 GAAATTGTTC AA.GGTTGCAG GAGATGAAAC AAATAAAACA TTTIAITTAT 200 

cmrroccTG caitccaaac caiaacagtg ttgagacagc tttaaacatt 250 

ACTGTTATTT GCAAGCATCA GCTCCCAATT CGTAAATGTA AAACTOCTTT 300 

TGAATTATCA ATGATGTTTT CT3ATTTAAA GGAGCCTTAC AACATTATTC 350 

ATGATCCTTC ATATCCCCAA A&GATTGTTC ATGCTCTGCT TGAAACTCAC 400 

3 0 ACATCTTTTG CACAAGTTCT TTGCAACAAC TTGGAAGAAG ATGTGATCAT 450 

CTACACCITG AACAACCATG AGCTMCTCC TGGAAAGTTA GATTTAGGTG 500 

AAATAACTTT GAATTACAAT GAAGA0G0CT ACAAMGC^A ATATTTCCTT 550 

TCAAAAACAC TTGAATGTCT TCCATCTAAC ATACAAACEA. TGTCTTATTT 600 

AGACAGCATC CAAATCCCTT CCTGGAAGAT AGACTTTGCC AGGGGAGAAA 650 

3 5 TTAAAATTTC TCCACAATCT ATTTCAGITG CAAAATCTTT GTTAAATCTT 700 

GATTTAAGCG GGATTAAAAA GAAAGAATCT AAGATTAAGG AAGCATATGC^ 750 

TTCAGGATCA AAATGATCIT GCTGTGTCCA GCTTTITCIA ATTATGTTAT 800 

GTTTATTTTC TTIUTTTACT TATMTTATT TTTCTGTTTG TCATTTCTTT 850 

CAAATTCCTC CTGTCTAGTA GAAACCATAA AAACAAAAAT AAAAATAAAA 900 
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. TAAAATCAAA ATAAAATAAA AATCAAAAAA TGAAATAAAA GCAACAAAAA 950 
AATTAAAAAA CAAAAAACCA AAAAAGATCC 03AAAGGACA ATTITGGCCA 1000 
AATTTGGGGT TIGTTTTTGT ITlTiUlTiT ITlWITiTiT GTTTTTATTT 1050 
TlAl'ilTTAT Til'iiAlTlTr AIT1TAT1TT AlTlTAlGTT TITGTTGTTT 1100 
5 TIGTTATTTT GTIATTTATT MGCACAACA CACAGAAAGCA. AALTITAAT 1150 
TAAACACACT TATT1AAAAT TIAACACACT AAGCAAGCACA AACAATAAA. 1200 
GATAAAGAAA GCTTKfflAIA TTTATAGGCT TTITEAIAAT TTAACTTACA 1250 
GCTGCT1T1A AfXAAGTTCT GTGAGTITTG CCFGTrmT MOCCCAAAC 1300 
ATITCATAGA ACTTGTTAAG GGTITCACTG TAATGTTCCA TAGCAATACT 1350 
1 0 TOCrrTAGCA TTAGGATTGC TGGAGCIMG TATAGCA^'TACICTTTCC 1400 
CCTlL'l'iCAC CTGATCTTCA TICATTTCAA ATGCT1TJ.CT TTTCAGCACA 1450 
GTGCAAACIT TTCCTAAGGC TTCOCTGGTG TCATAOTCT TTGGGTCGAT 1500 
CCCGAGATCC TTGTATTTTG CATCCTGATA TATAQCCAAG ACAACACTGA 1550 
TCATCTCAAA GCTATCAACT GAAGCAATAA GAGGTAAGCT ACCTCOCAGC 1600 

1 5 ATEATGGCAA GCCTCACAGA CTITGCATCA TCAAGAGGTA ATOCATAGGC 1650 

ITGAATCAAA GGGTGGGAAG CAATCITAGA TTTGATAGTA TEGAGATTCT 1700 
CAGAATTCC 1709 

The nucleic acid sequence for TSWV-PCR according to the present 
invention as depicted below also compares closely with the TSWV 
20 sequence given above and covers the whole nucleocapsid protein gene. 

TTMCACACT AAGCAAGCAC AAACAATAAA GATAAAGAAA GCTTTATATA 50 
TTTAIAGGCT TTITEAIAAT TTAACTTACA GCTGCTTTTA AGCAAGTTCT 100 
GrGAGITTTG CCTuiTlTIT AACCCCAAAC ATTTCATAGA ACTH3TTAAG 150 
GGTITCACTG TAATGTTCCA TAGCAATACT TCCTITAGCA TTAGGATTGC 200 

2 5 TGGAGCTAAG TATAGCAGCA TACTCTTTCC O-TltTiCAC CTGATCTTCA 250 

TTCATTTCAA ATGCTTTTCT TTTCAGCACA GTGCAAACTT TTCCTAAGGC 300 
TICCCIGGTG TCATACTTCT TTGGGTCGAT CCCGAGATCC TTGTATTTTG 350 
CATCCTGATA TATAGOCAAG ACAACACTGA TCATCTCAAA GCTATCAACT 400 
GAAGCAATAA GAGGTAAGCT ACCTCCCAGC ATEATGGCAA GCCTCACAGA 450 

3 0 CTTTGCATCA TCAAGAGGTA ATCCATAGGC TTGACTCAAA GGGTGGGAAG 500 

CAATCTTAGA TTTGATAGTA T1GAGATTCT CAGAATTCCC AGTTTCCTCA 550 
ACAAGCCTGA (XCTGATCAA GCTATCAAGC CTTCTGAAGG TCATGTCAGT 600 
GGCTCCAATC CTGTCTGAAG TITTCTTTAT GGTAATTTTA CCAAAAGTAA " 650 
AATCGCTTTG CTTAATAACC TTCATIATGC TCTGACGATT CTTCAGGAAT 700 
3 5 GTCAGACATG AAATAATGCT CATCTITTTG ATCTGGTCAA GGTTTTCCAG 750 
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ACAAAAAGTC TTGAAGTTGA ATGCIACCAG MTCTGATCT TCCTCAAACT 800 
CAAGGTC1TT GCCITGTGTC AACAAAGCAA CAATGCTTTC CFIAGTGAGC 850 
TTAAOCAT 858 

Together the dbrie'd tSWV-23 insert overlaps the TSWV-PCR 
5 insert* and together th§y represent the 2028 nucleotides of the TSWV- 
BL 6 RNA according to thS present invention. This 2028 nucleotide 
sequence according to the present invention contains a part of the 
nonstructural gene and whole nucleocapsid protein gene. The combined 
sequence is: 

1 0 AAATrcrcrr GCAGTGAAAT CTCTGCTCAT gttagcagaa' AACAACATCA 50 
TGCnEAACTC TGAAGCITIT GTCAAAGCIT CmCTGATTC TAATITCAAG 100 
CTGAGCCTCT GGCTAAGQGT TCCAAAGGTT TTGAAGCAGA TITCCATTCA 150 
GAAATTGTTC AAGGTTGCAG GAGATGAAAC AAATAAAACA 1T1TAT1TAT 200 
CTATTGCCTG CATTCCAAAC CATAACAGTG TTGAGACAGC TTTAAACATT 250 

1 5 ACTGTIATTT GCAAGCATCA GCTCOCAATT CGTAAATGTA AAACTCCTIT 300 

TGAATTATCA ATCATGTTTT CTGATTEAAA GGAGOCTTAC AACATIATTC 350 
ATGATCCITC AIATCCOCAA AGGATTGTTC ATGCTCTGCT TGAAACTCAC 400 
AGATCrTTTG CACAAGTTCT TTGCAACAAC TTGCAAGAAG ATGTGATCAT 450 
CTACACCTTG AACAACCATG AGCEAACTCC TGGAAAGTTA GATTTAGGTG 500 

2 0 AAATAACITT .GAATEACAAT GAAGACGCCT ACAAAAGGAA ATATTTCCTT 550 

TCAAAAACAC TTGAATGTCT TOCATCTAAC ATACAAACIA TGTCTTATIT 600 
AGACAGCATC CAAATCOCIT CCTGGAAGAT AGACITTGCC AGGGGAGAAA 650 
ITAAAATTTC TCCACAATCT ATTTCAGTTG CAAAATCITT GTEAAATCIT 700 
GATITAAGCG GGATTAAAAA GAAAGAATCT AAGATTAAGG AAGCATATGC 750 

2 5 TTCAQGATCA AAATGATCTT GCTGTGTCCA GLTITI'ICTA AJTATGTTAT 800 

GTTEATTITC TTTCTTTACT TATAATTATT TTTCTGTITG TCATTTCTIT 850 
CAAATTCCTC CTGTCEAGTA GAAACCATAA. AAACAAAAAT AAAAATAAAA 900 
TAAAATCAAA ATAAAATAAA AATCAAAAAA TGAAATAAAA GCAACAAAAA 950 
AATTAAAAAA CAAAAAACCA AAAAAGATCC CGAAAGGACA ATTTTGGOCA 1000 

3 0 AATTTGGGGT TTGTITTTGT 'iTiTMTiT TTTGrnTlT GlTTrTATIT 1050 

TTAlTiTiAT TTITATITIT AITiTAITiT ATTTIATGTT TTIGTTGTIT 1100 
TTGITATTTT GTEATnATT AAGCACAACA CACAGAAAGC AAACTTEAAT 1150 
TAAACACACT TATTIAAAAT TEAACACACT AAQCAAGCAC AAACAATAAA " 1200 
GATAAAGAAA GCTTEATATA TTEATAGGCT TTTTTATAAT TTAACTTACA 1250 
3 5 GCTCCTTTTA AGCAAGTTCT GTGAGTTTTG CCTGTTTTTr AACCOCAAAC 1300 
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ATITCAIAGA AC1TGT1!AAG GGTTTCACTG TAATGTTCCA TAGCAAIACT 1350 
TCCTTEAQCA. TEAGGA1TGC TGGAGCTAAG TAIAGCAGCA TACTCTTTCC 1400 
CCTTCITCA.C CTGATCTTCA TTCATTTCAA AT3CT1T1CT TTTCAGCACA 1450 
GTGGAAACTT TTCClMSQC. TTCCCTGGTG TCAIACTTCT TTGGGTCGKT 1500 
5 COCGAGATX 11WJMT«3 CR3XX3X3AIA. IAIAGCCAAG fltaAOOGA 1550 
imX^TCAAA GOMtAAdi? GAAGCAATAA GAGSEAAGCT AOCTOCCAGC 1600 
ATTKD39CM G0CTCACAGA dfrTCCATCA. TCAAGAGGTA ATCCATAGGC 1650 
TTGACICAAA. GGGTGGGAAG CAAICTEAGA TTTGATAGTA TIGAGATTCT 1700 
CAGAATTCCC AGTTTCCTCA. ACAAGCCTGA O0CTGATCAA GCTATCAAGC 1750 
1 0 CTTCTGAAGG TCATGTCAGT GQCTOCAATC CTGTCTGAAG 1T1TCT1TAT 1800 
GGTAATTTTA. CCAAAAGTAA AATCGCTITG CTEAAIAACC TTCATTATGC 1850 
TCTGACGATT CTTCAGGAAT GTCAGACATC AAATAATGCT CATCTTETTG 1900 
ATCTGGTCAA GGTTTTCCAG ACAAAAAGTC TTGAAGTTGA ATGCTACCAG 1950 
ATTCTGATCT TOCTCAAACT CAAGGTCTTT GCCTTGTGTC AACAAAGCAA 2000 

1 5 CAATGCTTTC CTTAGTGAGC TIAACCAT 2028 

This comparison showed thai cDNA insert of clone pTSWVS-23 
included about 760 bp Of the 52 K protein viral component gene, the 
complete intergenlc region (492 bp), and 450 bp of the NP gene (about 
half of the NP gene). This cloned insert had its 3'-end located exactly 
20 at an EcoRI recognition site, which suggested incomplete EcoRI 

methylation during the cDNA cloning procedure. Although this clone did 
not contain the complete TSWV-BL NP gene, its sequence was of 
considerable importance since it had a 450 bp overlap with the 
sequence of the PCR-engineered NP gene (a total of 2028 bp of the 

2 5 TSWV-BL S RNA is presented in the nucleotide, sequence for TSWV). The 

sequence comparison between this TSWV-BL PCR-engineered and TSWV- 
CPNH1 NP genes revealed a total of 21 nucleotide differences (2.7%), 
eight of which encode amino acid replacements (3.1%). Since this PCR 
engineered NP gene was obtained using Taq polymerase, which is known 

3 0 to incorporate mutations, it is possible that some of these differences 

were introduced during PCR amplification. However, 15 of these 
nucleotide differences were located within the overlapping region 
between the TSWV-BL cDNA and PCR clones, and all but one of these 
nucleotide differences (position 1702 of TSWV; position 485 of TSWV- 
3 5 PCR)) are shared by both TSWV-BL S RNA derived clones. This 
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comparison clearly showed that the PCR amplification did not 
bontfibute greatly, if at all* to the difference between the nucleotide 
Sequences of these two clofied NP gene regions. The nucleotide 
difference at poSitioh 1702 resulted In the amino acid replacement of 
5 tie With Ser, and even tht§ difference could be due to the lack of 
hoiridgeneity within the tSWV-BL isolate. 

EXAMPLE V 

AgtobdcteNUM-mt&l&teti transformation: 

Leaf discs of Nicoti&na tabacum var Havana cv 423 were 
1 0 inoculated with the Agrobacterium strain LBA4404 (ClonTech) 

containing the vector pBlNl9-NP+or the control plasmid pBIN19, by 
Soaking overnight in a liquid culture of the Agobacterium, and the 
inoculated leaf discs Were incubated on non-selective MS medium for 3 
days, [see Science 227:1229 (1985)]. Transformed cells were selected 

1 5 and regenerated in MS medium containing 300 jig/ml kanamycin and 500 

p,g/fnl carbenicillin for shoot regeneration. Roots were induced after 
transfer of ptantlets to hormone-free medium. Rooted transformants 
were transferred to soil and grown under greenhouse conditions. The MS 
medium contains full strength MS salt (Sigma), 30 g/l sucrose, 1 mg/l 
20 BA and 1 ml of B5 Vitamins [1 mg/ml Nicotinic acid, 10 mg/ml Thiamine 
(HCI), 1 mg/ml Pyridoxine (HCI), 100 mg/m! Myo-lnositot], Transgenic 
plants were self-pollinated and seeds were selectively germinated on 
kanamycin medium. 

EXAMPLE VI 

2 5 Serological detection of proteins: 

Double antibody sandwich enzyme-linked immunosorbent assay 
(DAS-ELISA) was used to detect the expression of NP gene in transgenic 
plants with polyclonal antibodies against the TSWV-BL NP. Each sample 
was prepared by grinding a leaf disc (about 0.05 g) from the top second 

3 0 leaf of the plant in 3 ml of an enzyme conjugate buffer [phosphate- 

buffered saline, 0.05% Tween 20, 2% polyvinylpyrrolidone 40, and 0.2% 
ovalbumin]. For tobacco protoplasts, the cell extracts after - x 
centrifugation were directly used for the assay. A ten- and three-fold 
dilutions of the samples from both transgenic plants and tobacco 
3 5 protoplasts were made just before DAS-ELISA. 
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For Western blots, a leaf disc (about 0.05 g) Was ground In 0.25 
ml of 2X SDS/sample buffer (0.126 M Tris buffer, 20% glycerol, 2% SDS, 
2% 2-mercaptoethanol, and 0.01 mg/ml bromphenol blue). The 
homogenates were centrituged and the supernatants Were boiled before 
5 loading. Proteins (iO-20 p.1 sample/lane) were separated and blotted 
onto a rnembrane. Th8 tliembrane Was then processed following the 
manufacturer's immiihoselect kit instruction manual (Gibco BRL Life 
Technologies Inc.). Antibodies to the whole virion were preabsorbed 
with cell extracts from health tobacco plants (See Plant Disease '70:501 
1 0 (1986)1, and were used in Western blot at a concentration of 2 u.g/ml. 
Serological reactions of TSWV isolates (TSWV-BL, Arkansas, 
10W pakchoy, Begonia or Brazil) were assayed in DAS-ELISA using 
antibodies raised against TSWV-BL virion, or the NP of TSWV-BL or 
TSWV-I. 

1 5 EXAMPLE VII 

Inoculation of transgenic plants with TSWV isolates. 

Inocula were prepared by infecting Nicotiana benthamiana Domin. 
with different TSWV isolates and grinding infected leaves (0.5 g) of N. 
benthamiana plants (1 to 2 weeks after inoculation) in 15 ml. of a 

2 0 buffer (0.033 M KH2PO4, 0.067 M K2HPO4 and 0.01 M Na2S03). The 

inoculum extracts were immediately rubbed on corundum-dusted leaves 
of transgenic plants and the inoculated leaves were subsequently rinsed 
with H2O. Because TSWV is highly unstable in vitro after grinding, each 
batch of Inoculum was used to first inoculate NP(+) plants containing 
25 the NP gene; the last inoculated plants of each inoculum were always 
control NP(-) plants containing the vector sequence alone to assure that 
a particular virus inoculum was still infective at the end of inoculation. 

Data on local lesions and systemic infections were taken 7-15 
days after inoculation and expressed in the following table as the 
. 3 0 number of plants systemicaliy infected over the number of plants 

inoculated, except where Indicated. In this table, the data collected 
under "ELISA" is the data of Ro lines from which the R1 plants were 
derived; the Begonia isolate induced local lesions on the R1 plants, an d 
the resistance was expressed as the number of plants producing local 



lesions over the numbef of plants inoculated; and NT indicates that 
there Was no test. 



Reactions of Rt p\m\£ Expressing the nucieocapsid protein (NP) gene of 
tomato Spbtted Wilt VlfUS (TSWV) to Inoculation with TSWV Isolates. 
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(ROpl.) 




Reactions to TSWV' Isolates 

Arkansas low Begonia 
Pakchoy 


Brazil 




Rn line 














10 


NP(+)2 


0.015 


0/20 


• 4/25 


3/24 


29/40 


36/36 




NP(+)4 


0.386 


6/30 


21/23 


18/21 


' , 9/48 


42/42 




NP(+)9 


0.327 


0/20 


NT 


20/20 








NP(+)14 


0.040 


0/20 




9/20 


8/18 


18/18 




NP(+)21 


0.042 


0/15 


5/15 


3/15 


2/4 


6/6 


15 


NP(+)22 


0.142 


0/20 




15/20 


31/36 


36/36 




NP(+)23 


0.317 


0/20 




16/20 








NP(-) 




42/42 


24/24 


62/62 


66/66 


54/54 



As described above, the isolation of the TSWV-BL NP gene, which 
resides in the S RNA component of TSWV, was approached using two 

20 strategies. The cDNA cloning strategy yielded several clones containing 
cDNA inserts derived from TSWV-BL S RNA, as identified by 
hybridization against an oligomer probe complementary to the TSWV- 
CPNH1 S RNA. Clone pTSWVS-23 contained the longest insert, which 
mapped at about 1.7 kb In length . The second strategy utilized the 

25 published sequence of TSWV-CPNH1 S RNA and PCR to amplify and 

engineer the NP gene for expression directly from total TSWV-BL RNA. 
Oligomer primers JLS90-46 and -47 were synthesized, with JLS90-46 
being complementary to the S RNA in the 5'-coding region of the NP gene 
(positions 2051-2073 of the TSWV-CPNH1) while JLS90-47 being of 

3 0 the 3'-noncoding region of the NP gene (positions 1218 to 1237 of the 
TSWV-CPNH1) . Both of the primers contain the recognition site for the 
restriction enzyme Nco\ for subsequent cloning, and the primer JLS90- 
46 has a plant consensus translation initiation codon sequence 
(AAXXATGG), Which Upon amplification was expected to fuse the 

3 5 translation initiation codon to the third codon (GTT) of the NP gene . 
Fusion of the translation initiation codon to the third codon of the 
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TSWV-BL NP gene Was dohe to preserve the Nco\ recognition site while 
not incorporating any ahiino acid codons. Thus, expression of the PCR- 
engineered TSWV NP gene Would yield a TSWV-BL NP that was two 
amino acids (Ser-Ly6) shorter at the N-terminus than the native NP. 
5 This specifically^arrtplified DNA fragment, of about 850 bp, was 

digested with Ncol and cloned into the plant expression Vector pB1 525. 
The orientation of the TSWV-BL NP gene with respect to the CaMV 35S 
promoter Was determined by restriction enzyme site mapping (EcoRI, 
Hindllt, Aval and AiwNl). Several clones Were isolated that contain the 
10 insert in the proper orientation (pB1525-NP+) arid .others that contain 
the insert in the opposite orientation (pB1525-NP-). This restriction 
enzyme site mapping data also showed that the inserts of clones 
PB1525-NP+ contained restriction enzyme sites that were identical to 
those found in the TSWV-CPNH1 NP gene. The expression of TSWV-BL 
15 NP gene was thus controlled by a double CaMV 35S promoter fused to 
the 5'-untranslated leader sequence of alfalfa mosaic virus (ALMV) of 
the expression vector pB1525. Expression vectors that utilize the 
stacked double CaMV 35S promoter elements yield higher levels of 
mRNA transcription than similar vectors that utilize a single 35S 
20 promoter element. 

Three pB1525-NP+clones were transiently expressed in tobacco 
protoplasts to confirm that the amplified DNA fragment encoded the NP. 
To achieve this, the clones were transferred into tobacco protoplasts 
by the PEG method, and after two days of incubation the expressed NP 

2 5 was detected by DAS-ELISA using antibodies against the Whole TSWV- 

BL virion . High levels of NP were produced in tobacco protoplasts 
harboring the NP gene in plasmid pB1525-NP+; while no NP was 
detected in tobacco protoplasts transformed with the antisense NP 
sequence (pB1525-NP _ ). 

3 0 As described previously, the PCR-engineered insert of clone 

pB!525-NP+ and teh cDNA insert of the clone pTSWV-23 Were subjected 
to double stranded sequencing. The sequence analysis of the cDNA and 
the PCR clones revealed inserts of 1.71 kb and 865 bp, respectively 
which, when compared With the sequence TSWV-CPNH1 S RNA, shows 
3 5 that cDNA insert of clone pTSWV-23 includes about 760 bp of the 52 K 
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protein viral component gene, the complete intergenic region (492 bp), 
and 450 bp of the NP gefte (about one-half of the gene). This cloned 
insert has its 3'-end located exactly at an EcoRI recognition site 
suggesting! incomplete £coRI methyiation during the 1 cDNA cloning 
5 procedure. Although this clone does not contain the complete TSWV-BL 
NP gene, Its sequence Is of considerable importance since it has a 450 
bp overlap With the sequence of the PCR-engineered NP gene. The 
sequence comparison between this TSWV-BL PCR-engineered and.TSWV- 
CPNH1 NP genes reveals a total of 21 nucleotide differences (2.7%), 
1 0 eight of which encode amino acid replacements (3'.t%). Since this PCR- 
engineered NP gene was obtained using Tag polymerase, which is known 
to incorporate mutations, it is possible that some of these differences 
were introduced during PCR amplification. However, 15 of these 
nucleotide differences are located within the overlapping region 

1 5 between the TSWV-BL cDNA and PCR clones, and all but one of these 

differences (position 1702) are present in both TSWV-BL S RNA derived 
clones. This comparison clearly shows that the PCR amplification did 
not contribute greatly, if at all, to the difference between the 
nucleotide sequences of these two NP genes. The nucleotide difference 
20 at position 1702 results in the amino acid replacement of lie with Ser, 
and even this difference could be due to the lack of homogeneity within 
the tSWV-BL isolate. 

The possibility that the nucleotide differences can be attributed 

to divergence among the TSWV isolates is also supported by 

2 5 comparisons with other sequenced regions among TSWV-CPNH1, TSWV- 

L3, and TSWV-Bl S RNAs. These comparisons are tabulated below: 

Percent nucleotide and amino acid sequence differences for the comparison of TSWV S 
RNA component from isolates CPNH1 , L3 and BL a 

52 K Protein Gene intergenic NPQene 

3 0 Comparison Nucleotide Amino Acid Nucleotide Nucleotide Amino Acid 

CPNH1/L3 68/1396 b (4.9) c 49/464(10.6) 46/511(9.0) 24/777(3.1)- 4/258(1.6) 

CPNW/BL 21/758(4.1) 23/251(9.2) 26/496(5.2) 19/765(2.5) 8/255(3.1) 

L3/BL 38/765(5.0) 20/254(7.9) 38/498(7.6) 19/767(2.5) 4/255(1.6) 
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a Comparisons are made Using, the sequence information available from the 
particular component region of tSWV-BL The comparison for the TSWV-BL NP 
gene includes the combined sequence information from the cDNA done, pTSWVS-23 
and PCR-engineered Insert. 
5 b Comparison numbers are total differences (nucleotides or amino acids) 
divided by total number 6f positions (nucleotides or amino acids) compared. For 
both nucleotide and amlttd field calculation gaps, regardless of length, were counted as 
one mismatch. 

c Numbers in parentheses are percentages. 

1 0 The nucleotide Sequence of the NP genes from the CPNH1 and L3 

isolates differ from each other by 3.1% and from the BL isolate by ' 
nearly a similar degree (2.5%). However, the NP amino acid sequences 
between CPNH1 and BL Isolates differ by a considerably larger amount 
than they differ between the L3 and BL or CPNH1 and L3 isolates. The 

1 5 results tabulated above also reveal that the NP gene region of these 
TSWV Isolates is subject to a higher degree of selective pressure than 
the 52 K protein as the differences among the amino acid sequences of 
the 52 K protein range between 7.9 to 10.6%, more than twice that 
found for the amino acid sequence of the NPs. Nucleotide sequence 

20 divergence is highest among the intergenlc regions, Indicating that this 
region is subject to less selective pressure than either genetic region. 

The presence of NP gene sequences In transgenic plants was first 
confirmed by PCR analysis. A NP DNA fragment of about 800 bp was 
specifically amplified from the total DNAs of transgenic NP(+) plants 

25 using the primers homologous to sequences flanking the NP gene, 
whereas no corresponding fragment was detected in control NP(-) 
plants. Expression of the NP gene was assayed in each Ro transgenic 
plant by DAS-ELISA, and the results are presented in the following 
table: 
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Reactions of RO transgenic plants expressing the nucleocapsid protein (NP) gene of 
tomato spotted Wilt virus (TSWV) to inoculation with TSWV-BL isolate 





plant age 


Ro clone 


ELISA a 


Lesions/leaP 


NP(4):NP( 


5 


7-8 leaves: 


NP(+)1 


0.374 


7 (199) 


1:28 






NP(+)2 


0,015 


0 (199) 


0:199 






NP(+)3 


0.407 


23 (102) 


1 :4 






NP(+)4 


0.386 


2 (102) 


1 :51 






NP(+)5 


0.023 


0 (124) 


0:124 


1 0 




NP(+)6 


0.197 


35 (325) . 


1:9 






NP(+)7 


0.124 


1 (325) 


1:325 




9-10 leaves: 


Kip/ +\a 


0 344 


36 (3$) 


1 :1 






NP(+)9 


0.327 


2 (20) 


1:10 


15 




NP(+)10 


0.406 


34 (33) 


1:1 






NP(+)11 


0.156 


5 (20) 


1:4 






NP(+)12 


0.133 


9 (57) 


1:6 






NP(+)13 


0.144 


2 (7) 


1:4 






NP(+)14 


0.040 


0 (19) 


0:19 
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NP(+)16 


0.053 


0 (10) 


0:10 




5-6 leaves: 




n A.A7 


CXJO \ I 1 / ) 


2*1 

Cm • I 






NP(+)21 


0.042 


0 (117) 


0:117 






NP(+)22 


0.142 


0 (208) 


0:208 


25 




NP(+)23 


0.317 


223 (208) 


1:1 






NP(+)24 


0.051 


0 (35) 


0:35 






NP(+)25 


0.286 


13 (35) 


1:3 






NP( + )26 


0.037 


0 (22) 


0:22 






NP(+)27 


0.425 


305 (22) 


14:1 



3 0 production of the NP in transgenic plants was assayed by double antibody 

sandwich enzyme-linked immunosorbent assay (DAS-ELISA); concentration of 
antibodies against viron tor coating: 1 jig/ml; dilution of conjugate to the NP of 
TSWV-BL: 1:250; results taken 150 min. after adding substrate; readings at 405 
nm. 

3 5 tyocal lesions that developed oh inoculated leaves were counted 7 days after 
modulation. Data represent the average of three Inoculated leaves. Data In 
parehtheses are the number of lesions produced from control NP(-) plants 
inoculated with the same inoculum. 

c the ratio of local lesions that developed on NP(+) plants transformed with pBIN19- 
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NP + versus local lesions that developed on the control NP(-) plant when Inoculated 
with the same inoculum. 

Of the 23 NP(+) clones, 10 produced high levels of NP, 5 
accumulated intermediate levels of NP, and the remaining 8 produced 

5 low levels of NP. The site of the NP expressed i«- transgenic plants Was 
analyzed using Western blot. Many polypeptides from tobacco extracts 
were reactive to the antibodies against the whole viron even though the 
antibodies were pre-absorbed with extracts from healthy tobacco 
plants. Of those, only one band was unique to the pattern of 

1 0 polypeptides from tobacco plants transformed with' the NP gene. This 
polypeptide was estimated to be around 29 kDa, which is hear the 
expected size of the native NP. No antibody reactive-protein band of 
similar size was found in extracts from transgenic plants containing 
the vector pBIN19. 

1 5 Inoculation of tobacco leaves with TSWV-BL isolate could result 

in either systemic infection or necrotic local lesions, depending upon 
weather conditions and physiological stages of plants. When Rq plants 
were tested with TSWV-BL for viral resistance, TSWV-BL induced 
typical necrotic leSidhs oft the inoculated leaves of control NP(-) plants 

20 6-8 days after inoculation. However, transgenic NP(+) plants showed a 
spectrum of resistance to the virus when compared to control NP(-) 
plants. Eleven of the 23 NP(+) plants did not develop any local lesion or 
the number of lesions that developed was at least 20-fold less than 
that on the corresponding inoculated NP(-) plants. Three NP(+) plants 

25 had intermediate reactions (5- to 19-fold less lesions than controls) 
while the remaining 9 plants had low or no resistance. None of the 
inoculated NP(+) or NP(-) plants showed systemic infection, 
symptomless Rrj plants were monitored until the end of their life cycle, 
and no symptom Was observed throughout their life cycles. The 
3 0 Inoculated leaves of the Symptomless NP(+) plants were checked for the 
presehce of the virus on the leaves of C. quinoa plants. No virus was 
recovered from TSWV-BL-challenged leaves of highly resistant NP(+) 
plants, suggesting that the virus cold not replicate or spread in these 
NP(+) plants. 
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Leaf discs from selected Rq plants were subctoned, and the 
regenerated plantlets were challenged by the virus. All subcloned Ro 
plants displayed levels of resistahce similar to their corresponding 
original Ro plants. 

5 Since TSWV is widespread and many biologically distant strains 

exist* the effectiveness of the transgenic plants to resist infections by 
different TSWV isolates were also tested. Five TSWV isolates Were 
chosen in this study to challenge R1 plants germinated on kanamycin- 
containing medium: TSWV-BL, Arkansas, 10W pakchoy, Begonia and 
1 0 Brazil. The first three isolates were reactive to the antibodies against 
the Whole virion and the NP of TSWV-BL (the common TSWV 
Serogroup) (see figure 5). Begonia isolate reacted strongly to the 
antibodies against the NP of TSWV-I (the "I" serogroup) but not to those 
raised against the TSWV-BL NP, and therefore belonged to the T 
1 5 serogroup. No detectable reaction of Brazil isolate was found to the 
antibodies against either the NP of the TSWV-BL or the TSWV-i 
serogroup, and it was weakly reactive to the antibodies against the 
Whole viron of TSWV-BL. Moreover, this isolate caused systemic 
mottle and crinkle on the leaves of infected tobacco and N. benthamiana, 
20 but did not Infect squash or cucumbers indicating that it is a distinct 
isolate from the cucurbit Isolate. These results indicate that this 
isolate may be considered to be a third serogroup. 

Seedlings derived from seven Ro lines were germinated on 
kanamycin medium and inoculated with the above TSWV isolates. 
25 Infectivity data were recorded daily starting seven days after 

inoculation. Plants inoculated with TSWV-BL, Arkansas, 10W pakchoy 
or Brazil isolates were rated susceptible if virus symptoms were 
observed on uninoculated leaves. Plants inoculated with the Begonia 
isolate were rated susceptible if local lesions were observed on 
3 0 inoculated leaves, since this isolate does not cause systemic infection 
in tobacco. All inoculated control NP(-) T*1 plants were susceptible to 
infection by these five isolates. They were systemically infected 12 
days after inoculation except that transgenic Ri plants inoculated with 
Begonia produced only local lesions on the inoculated leaves. However, 
3 5 almost all NP(4) Ri plants were highly resistant to the homologous 
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isolate TSWV-BL, while much lower percentages of NP(+) Ri plants 
were resistant to heterologous Isolates Arkansas, 10W pakchoy and 
Begonia. On the other hand, all NP(+) Ri plants from the seven 
transgenic lines Were susceptible to the Brazil Isolate* even though a 
5 slight delay (1 to 2 days) irt symptom expression was observed in some 
of the high NP-expressittp; NP(+) R1 plants from lirte NP(+)4. 

Resistant Ri plants remained symptomless throughout their life 
cycles. The inoculated leaves of seventeen symptom less NP(+) plants 
were checked tor the presence of the virus by back inoculation on 

1 0 leaves of Chetiopodium quinoa plants. No virus was recovered from the 

inoculated leaves of symptomless NP(+) plants, suggesting that the 
virus could not replicate or spread in these NP(+) plants. 

The relationship between the level of NP accumulation in 
transgenic plants and the degree of resistance to heterologous TSWV 
1 5 isolates was also studied. Analysis of the data described above 

suggested that R1 plants derived from Rrj lines With low levels of NP 
offered the best resistance to the heterologous isolates of the "L" 
serogroup (Arkansas and 10W pakchoy) while Ri from a Ro line with 
high level of NP were resistant to the Begonia isolate, which belongs to 

2 0 the "I* serogroup. For example, an average 76% of inoculated Ri plants 

from low NP expressing lines NP(+) 2, 14, and 21 were resistant to 
infections by the Arkansas and 10W pakchoy isolates, While resistance 
to these isolates was observed in only 11% of similarly inoculated 
plants from high NP expressing lines NP(+)4, 9, and 23. On the other 

2 5 hand, the Begonia isolate infected 79% of Ri plants from the low NP 

expressing line NP(+)2, 14, and 21 but only 19% from high NP expressing 
line NP(+)4. 

Therefore, it was concluded that the transgenic Ri plants 
expressing low levels of the NP gene were highly resistant to infection 

3 0 with the isolate 10W pakchoy (the "L" serogroup), but not to Begonia 

isolate (the "I" serogroup). In contrast, the highly NP-expressing Ri 
plants were very resistant to infection by Begonia isolate but not to 
infection by the isolate from 10W pakchoy. 

Thus, it was of interest to accurately quantltate the relation of 
3 5 NP expression in individual plants with resistance to the heterologous 
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isolates. In a number of inoculation experiments reported herein, leaf 
samples of transgenic plants were taken before inoculating with the 
Arkansas and 10W pakchoy isolates. Samples were also taken from 
non-ihoculated leaves of plants Inoculated With the Begonia isolate 
5 after observations of the apparent relation between NP expression 
levels and resistance were made, the tatter method of sampling could 
be done without Interference frorh infection by the Begonia isolate 
because this isolate tides hot cause systemic infection in tobacco nor 
reacts with antibodies to the TSWV-BL Np. All samples were assayed 
1 0 for relative NP levels by DAS-ELISA using antibodies raised to isolated 
NP of fSWV-BL Figures 5 and 6 show the relation between NP levels in 
transgenic Ri plants (irrespective of the Rrj lines they came from) and 
their resistance to the Arkahsas and 10W pakchoy isolates or to the 
Begonia isolate. Nearly all transgenic P»1 plants With very low or 

1 5 undetectable ELISA reactions (0-0.05 OD405nm) were resistant to 

infections by the Arkansas and 10W pakchoy isolates (the "L" serogroup) 
but susceptible to the Begonia isolate (the "I" serogroup). In contrast, 
almost all Ri plants that gave high ELISA reactions (0.4-1.0 OD405nm) 
were resistant to the Begonia isolate but susceptible to the Arkansas 
20 and 10W pakchoy Isolates. 

The double-stranded (ds) RNA was isolated from the N. 
benthamlana plants infected with TSWV-B using a combination of 
methods [See Acta Horticulturae 186:51 (1986), and Can. Plant Dis Surv 
68:93(1988)) which have been successfully used for isolation of dsRNA 

2 5 from tissue infected With grapevine leafroll virus. The dsRNA was 

chosen for the cDNA synthesis since isolation of the virus particle from 
this isolate has not been possible {see Plant Disease 74:154 (1990)]. In 
order to make a cDNA library specific to the S RNA of TSWV-B, the 
double stranded S RNA was gel-purified, denatured by methyl-mercury 

3 0 treatment, and subjected to cDNA synthesis procedure provided by 

Promega using random primers. The synthesized cDNA fragments were 
cloned via an EcoRI adaptor into the EcoRI digested \ ZAPII 

(Strategene), and positive clones were identified by colony 
hybridization using the cDNA probes prepared by reverse transcription 
3 5 of gel-purified S RNA. Dozens of positive clones were analyzed on 
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agarose gels and only three overlapping clones containing the largest 
inserts (L1 t L22 aild L30) Were selected (see figure 3), covering nearly 
entire TSWV-B S RNA. 

The nucleotide Sequences of the inserts In clones L1, L22 and L30 
5 were determined froitt both DNA strands, first by the universal and 
reverse primers and th§fi by th§ internal primers designed for 
sequencing the S RNA 6r TSWV-B. Sequencing was done using the 
Sanger didedxyribohudeotide method, T7 polymerase (U.S. 
Biochemlcals, Sequeha§§ TM), and the double-stranded sequencing 

10 procedure described by Siemieniak [see Analyt.'Biochem. 192:441 
(1991)]. The sequence analyses of these clones revealed inserts of 
1.994 kb, 2.368 kb and 1.576 kb, respectively, and these sequences 
represented 93% of the S RNA genome (see figure 3). The assembled 
sequence was analyzed by comparisons with sequences of TSWV 

15 isolates CONH1, L3, I, and BL using computer programs available from 
the Genetics Computer Group (GCG, Madison, Wl). 

Computer analysis showed that the assembled sequence of 2.842 
kb covered the complete 52 K nonstructural protein gene, the complete 
intergenic region (629 bp), and 737 bp of the NP gene (only 39 N- 

20 terminal nucleotides of the N gene were not represented). In order to 
obtain this missing region of the N gene, a primer 
TTCTGGTCTTCTTCAAACTCA, Identical to a sequence 62 nucleotides 
from the initiation codon of the N gene, was end-labeled with 
polynucleotide kinase to screen the cDNA library described above. Five 

25 putative clones were obtained. Sequence analysis of the five clones 

showed that only clones S6 and S7 contain these 39 missing nucleotides 
of the N gene. The latter clone also included the extreme 3' end of the S 
RNA 

The 5' extreme end of the S RNA was obtained using the 5' RACE 
3 0 System (GIBCO). Both SsRNA of TSWV-B and total RNAs isolated from 
tobacco plants infected with TSWV-B were used to synthesize first 
strand cDNA with an oligonucleotide (5-CTGTAGCCATGAGCAAAG) 
complementary to the nucleotide positons 746-763 of te TSWV-B S 
RNA. The 3'-end of the first strand cDNA was tailed with dCTP using 
3 5 terminal deoxynucleotidyl transferase. Tailed cDNA was then amplified 
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by PCR using an anshor primer that anneals to the homopolymeric tail, 
and art oligonucleotide (5-TtATATCTTCTTCTTGGA) that annuals to the 
nucleotide positions 512-529 of the TSWV-B S RNA. The PCR- 
ampltifi&d fragement Was gel-purified and directly cloned into the T- 

5 vector pi7Blue (Novagen) tor sequence analysis. Eight independent 
clones were sequenced with an oligomer primer (5- 
GlTCTGAGATTTGCTAGT) close to the 5' region ot the S RNA (nucleotide 
positions 40-57 of the TSWV-B S RNA). Six of the resulting clones 
contained the 5' extreme end ot the S RNA and the 5-termihal 

1 0 nucloetlde sequence from these clones was identical.. Thus, the 

complete nucleotide sequence of the TSWB-B S RNA Is 3049 nucleotides 
In length. 

Thus these two clones together with the three clones previously 
sequenced (L1, L22, L30, S6 and S7) covered a total of 3032 nucleotides 

1 5 depicted above. Comparisons with the terminal sequences of TSWV- 
CPNH1 and TSWV-I revealed that although the extreme 5' end ot 18 
nucleotides was not represented in the assembled sequence, the 
extreme 3-termlnus ot the TSWV-B S RNA is Identical to the extreme 
3' end of the TSWV-I S RNA and is only one out of fifteen nucleotides 

20 differefit from the extreme 3' end of TSWV-CPNH1. The conservation of 
the terminal sequence among TSWV isolates is consistent with 
observations of the other members of Bunyaviridae genera, and supports 
the hypothesis that the terminal sequences might form stable base- 
paired structure, which could be involved in its replication and 

25 encapsulation. 

The complete nucleotide sequence of the S RNA genome of TSWV- 
B (the Brazilian isolate discussed above) according to the present 
invention is: 

AGAGCAA1TG GGTCATTTTT TATTCEAAAT CGAAOCTCAA CTAGCAAATC 50 
3 0 TCAGAACTGT AATAAGCACA AGAGCACAAG AGOCAC AATG TCATCAGGTG 100 



3 5 ATTTTATCTrC AGAACXTTTCA TATCCCAGTG TTTGATGATA TO5ATTTCAG 350 




150 



200 



250 



300 
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cKrrjwrter. aatc^ttctt tcttggcaat ttctgtttgt TceAAC&caG 400 

A TGGAGTGAAG GATCAGGGTC ATCITAAAGT TCTTTCTCTT 450 

: ATHCTrim aqctgtcatg agcaggtcag agattgctag 500 

AG AAGAWAAT TCCTGATGAC AAAmTKEAT 550 

fr .cftraxxraro ttaaagaaca mfemcRAA 600 

M1t&.(^<^tt&C^^ AAAGTGAATB *IX?ITPCTOe 650 

AAT rtfiMTORCTr GGCTSTATAG TTTCAAAOCA AATITCAACC 700 

Mr WfrAGAGA Ac^rsiaAArr ctcttggagt caaatctttg 750 

(Mmoph gmm mctcttmCt cmggro& soo' 

AQCfltefcACT GAT^mTATT TTAAGTPGAG (CT TTOOS AGAATTOCAA 850 
AftGrtfiTGAA CyaAATAGQG ATACAGAAGC TfTirAAGTT TGCAGGAGAC 900 
GAAAGGGGm AAAGTTK3A TCTGTCTATT GCATCTATOn CAAATCACAA 950 
CAGflgflGGAA ACflGCTTEAA ATGTCACTGT TATATGTAGA CATCAGCTTC 1000 
CAATQQCTAA GTOCAAAGCT CCTTTTTGAAT TA TTAATGAT TTTCTCOGAT 1050 
CTGAAAGAGC CTTACAAGAC TG TGTATGAT OCTITATATC CTCAAAGGAT 1100 
TGTTCAfrGCT TTGCTTGAGA CITACACTTC CTTTGCACAA GTTCTCTGCA 1150 
ACAAQCTGCA AGAAGATGTO ATCATATATA CTATAAACAG OCCTGAACTA 1200 
ACXXXftGCTA AGCTCGATCT AGGTGAAAGA ACCITGAACT ACAGTGAAGA 1250 
TGCTTOSAAG AA GRAGTATT TTGTTTCAAA AACACTCGAA TGCTTGCCAG 1300 
TAAATCTQCA GACTATSTCT TATTTCGATA GTATXTAGAT TCOTTCATGG 1350 
AAGATAGACT TTGCTAGAGG AGAGATCAGA ATCTCCCCTC AATCTACTCC 1400 
TATTGCAAGA TCTTTGCTCA AGCTGGATTT GAGTAAGATC AAGGAAAAGA 1450 
AGTCCTTGAC TTGGGAAACA TCTAGCTATG ATCTAGAATA AAAGTGGCTC 1500 
ATACTA.CTCT AAGTAGTATT TGTCAACTTG CTEATCCTTT ATGTTCTTTA 1550 
TTTCTTTIAA A3CTAAAGTA AGTIAGATTC AAGTAGTTTA GTATGCTATA 1600 
GCAITATEAC AAAAAATACA AAAAAATACA AAAAAATACA AAAAATATAA 1650 
AAAACCCAAA AAGATCCCAA AAGGGACGAT TIGGTTGATT 1ACTCTGTTT 1700 
TAGGCTTATC TAAGCTGCTT TTGTTTGAGC AAAATAACAT TGTAACATGC 1750 
AATAACTGGA ATITAAAGTC CTAAAAGAAG TITCAAAGGA CAGCTEAGCC 1800 
AAAATTGGTT TTTG1TJ.TTG TTITITrGTT TITTGTTrrT TTGTTTTATT 1850 
TTTAl'ITTTA GTITATTTTT TGTTITTGIT ATTITTA1TT TEATITEATT 1900 
TTLTiTIATT TTATTTATAT A3ATATCAAA OOATCCAC ACAAATAATT 1950 
TTAATTTCAA ACATTCTACT GAITIAACAC ACTTAGCCTG ACTTTATCAC ^2000 
ACTEAACACG CTEAGTTAGG CTITAACACA CTGAACTGAA TTAAAAGACA 2050 
CTTAGTATTA TGCATCTCIT AATTAACACA CTITAATAAT ATGCATCTCT 2100 
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GAATCAQOCT TAAAGAAGCT TTTATGCAAC ACCAGCAATC TTGGQCTCTT 2150 

TCTTAACTCC AAACATTTCA TAGAATTTGT CAAGATTATC ACTGTAATAG 2200 

TOCATAGCAA TGCTTCOCTT AGCATTGGGA TTGCAAGAAC TAAGTATCIT 2250 

GQCATATTCT TTCXXTTTGT TTATCTGTGC ATCATOCATT GTAAATCCTT 2300 

5 TGCTTITAAG CACTGTGCAA ACCTTCOOCA GAGCTTCCTT AGTGTTGTAC 2350 

TTAGTTUtfiT CAATOCCTAA CTXTTCTAC TTTGCATCIT GATATATQGC 2400 

AAGAACAACA CTGATCATCT CGAAGCTGTC AACAGAAGGA ATGAGAGGGA 2450 

TACTACCTCC AAGCA1TAIA GCAAGTCTCA CAGATTTTGC ATCTGCCAGA 2500 

GGCAQCOOGT AAGCTTGGAC CAAAGGGTGG GAGGCAAITT TTGCTTTSVT 2550 

1 0 AATAQCAAGA TTCTCAITGT TTGCAGTCTC TTCTATGAGC TTCACTCTTA 2600 

TCATQCTATC AAGCCTOCTG AAAGTCATAT CCITAGCTCC AACTCTTTCA 2650 

GAAlTiTlCT TTATCGTGAC CTEACCAAAA GTAAAATCAC TTTGGTTCAC 2700 

AACTTKATA ATGCCTTGGC GAITCTTCAA GAAAGICAAA CATGAAGTGA 2750 

TACTCAITTT CTTAATCAGG TCAAGAITTT CCTGACAGAA AGTCTTAAAG 2800 

1 5 TTGAATQCGA CCTGGTTCTG GILTiLTICA AACTCAAGAT CTGCAGATTG 2850 

AGTTAAAAGA GAGACAATGT ITlUi'lTlGT GAGCTTGACC TTAGACATGG 2900 

TGGCAGTnA GATCTAGACC TTTCTCGAGA GATAAGATTC AAGGTGAGAA 2950 

AGTGCAACAC TGIAGAQCGC GGTCGTTACT TATGCTGTTA ATGTGATGAT 3000 

TTGTA1TGCT GAGTAITAGG TITTTGAATA AAATTGACAC AATTGCTCT 3049 

20 The deduced amlho acid sequences of the nonstructural (single 

underlined above) and nucleocapsid proteins according to the present 
invention are: 



Met 


Ser Ser Gly Val Tyr 


Glu Ser 


He 


He Gin Thr 


Lys Ala Ser 




5 






10 




15 


Val 


Trp Gly Ser Thr Ala 


Ser Gly 


Lys 


Ser He Val 


Asp 


Ser Tyr 




20 






25 




30 


Trp 


He Tyr Glu Phe Pro 


Thr Gly 


Ser 


Pro Leu Val 


Gin 


Thr Gin 
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40 




45 


Leu 


Tyr Ser Asp Ser Arg 


Ser Lys 


Ser 


Ser Phe Gly 


Tyr 


Thr Ser 




50 






55 




60 


Lys 


He Gly Asp He Pro 


Ala Val 


Glu 


Glu Glu He 


Leu 


Ser Gin 




65 






70 




75 


Asn 


Val His He Pro Val 


Phe Asp 


Asp, 


He Asp Phe 


Ser 


He Asn 




80 






85 




90 


He 


Asn Asp ser Phe Leu 


Ala He 


Ser 


Val Cys Ser 


Asn 


Thr. Val 




95 






100 




105 


Asn 


Thr Asn Gly Val Lys 


His Gin 


Gly 


His Leu Lys 


Val 


Leu Ser 




110 






115 




120 
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Leu Ma Gin Leu His Pro Phe Glu Pro Val Met Ser Arg Ser Glu 
125 130 135 

He Ala Ser Arg Phe Arg Leu Gin Glu Glu Asp He He Pro Asp 
140 145 150 

Asp Lys Tyr He Ser Ala Ala Asn Lys Gly Ser Leu Ser Cys Val 
155 160 165 

Lys Glu His Thr Tyr Lys Val Glu Met Ser His Asn Gin Ala Leu 
170 175 180 

Gly Lys Val Asn Val Leu Ser Pro Asn Arg Asn Val His Glu Trp 
185 190 195 

Leu Tyr Ser Phe Lys Pro Asn Glu Asn Gin He Glu Ser Asn Aso. ' 

200 205 ■ 210 

Am Thr Val Asn Ser Leu Ala Val Lys Ser Leu Letf'Met Ala Thr 
215 220 225 

Glu Asn Asn He Met Pro Asn Ser Gin Ala Phe Val Lys Ala Ser 
230 235 240 

Thr Asp Ser His Phe Lys Leu Ser Leu Gin Leu Arg He Pro Lys 
245 250 255 

Val Leu Lys Gin He Ala He Gin Lys Leu Phe Lys Phe Ala Gly 
260 265 270 

Asp Glu Thr Gly Lys Ser Phe Tyr Leu Ser lie Ala Cys He Pro 
275 280 285 

Asn His Asn Ser Val Glu Thr Ala Leu Asn Val Thr Val He Cys 
290 295 300 

Arg His Gin Leu Pro He Pro Lys Ser Lys Ala Pro Phe Glu Leu 
305 310 315 

Ser Met He Phe Ser Asp Leu Lys Glu Pro Tyr Asn Thr Val His 
320 325 330 

Asp Pro Ser Tyr Pro Gin Arg He Val His Ala Leu Leu Glu Thr 
335 340 345 

His Thr Ser Phe Ala Gin Val Leu Cys Asn Lys Leu Gin Glu Asp 
350 355 360 

Val He He Tyr Thr He Asn Ser Pro Glu Leu Thr Pro Ala Lys 
365 370 375 

Leu Asp Leu Gly Glu Arg Thr Leu Asn Tyr Ser Glu Asp Ala Ser 
380 385 390 

Lys Lys Lys Tyr Phe Leu Ser Lys -Thr Leu Glu Cys Leu Pro Val 
395 400 405 

Asn Val Gin Thr Mat Ser Tyr Leu Asp Ser He Gin He Pro Ser 
410 415 420 

Trp Lys He Asp Phe Ala Arg Gly Glu He Arg He Ser Pro^Gln 
425 430 435 

Ser Thr Pro He Ala Arg Ser Leu Leu Lys Leu Asp Leu Ser Lys 
440 445 450 
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He Lys Glu Lys Lys Ser Leu Thr Trp Glu Thr Ser Ser Tyr Asp 
455 460 465 

Leu Glu; 
and 

5 Met Ser Lys Val Lys Leu Thr Lys Glu Asn He Val Ser Leu Leu 

5 10 15 

Thr Gin Ser Ala Asp Val Glu Phe Glu Glu Asp Gin Asn Gin Val 

20 25 30 

Ala Phe Asn Phe Lys Thr Phe Cys Gin Glu Asn Leu Asp Leu He- 
10 35 40 45 

Lys Lys Met Ser He Thr Ser Cys Leu Thr Phe Leu Lys Asn Arg 
. 50 55 60 

Gin Gly He Met Lys Val Val Asn Gin Ser Asp Phe Thr Phe Gly 

65 70 75 

1 5 Lys Val Thr He Lys Lys Asn Ser Glu Arg Val Gly Ala Lys Asp 

80 85 90 

Met Thr Phe Arg Arg Leu Asp Ser Met He Arg Val Lys Leu He 

95 100 105 

Glu Glu Thr Ala Asn Asn Glu Asn Leu Ala He He Lys Ala Lys 
20 110 115 120 

He Ala Ser His Pro Leu Val Gin Ala Tyr Gly Leu Pro Leu Ala 
125 130 135 

Asp Ala Lys Ser Val Arg Leu Ala He Met Leu Gly Gly Ser He 
140 145 150 

2 5 Pro Leu He Ala Ser Val Asp Ser Phe Glu Met He Ser Val Val 

155 160 165 

Leu Ala He Tyr Gin Asp Ala Lys Tyr Lys Glu Leu Gly He Glu 

170 175 180 

Pro Thr Lys Tyr Asn Thr Lys Glu Ala Leu Gly Lys Val Cys Thr 

3 0 185 190 195 

Val Leu Lys Ser Lys Gly Phe Thr Met Asp Asp Ala Gin He Asn 
200 205 210 

Lys Gly Lys Glu Tyr Ala Lys He Leu Ser Ser Cys Asn Pro Asn 
215 220 225 

3 5 Ala Lys Gly Ser He Ala Met Asp Tyr Tyr Ser Asp Asn Leu Asp 

230 235 240 

Lys Phe Tyr Glu Met Phe Gly Val Lys Lys Glu Ala Lys He Ala 
245 * 250 255 

Gly Val Ala 

40 As the nucleocapsid protein gene depicted above is on the viral 

complementary strand, the nucleocapsid protein gene of TSWV-B is: 

ATG TCT AAG GTC AAG CTC ACA. AAA. GAA AAC ATT GTC TCT CTT TTA 45 
ACT CAA TCT GCA GAT GTT GAG TTT GAA GAA GAC CAG AAC CAG GTC 90 
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GCA TTC AAC ITT AAG ACT TTC TGT CAG GAA AAT CTT GAC CTG ATT 135 
AAG AAA. ATG ACT ATC ACT TCA TGT TTG ACT TTC TTG AAG AAT CGC 180 
CAA GGC ATT ATG AAA GTT GTG AAC CAA ACT GAT TTT ACT TTT GGT 225 
AAG GTC ACG ATA AAG AAA AAT TCT GAA AGA GTT GGA GCT fflG GAT 270 
5 ATG ACT TTC AGG AGG CTT GAT AGC ATG ATA AGA GTG AAG CIC ATA 315 
GAA GAG ACT GCA AAC AAT GAG AAT CTT GCT ATT ATC AAA GCA AAA 360 
ATT GCC TGC CAC OCT TTG GTC CAA GCT TAG GGG CTG CCT CTG GCA 405 
GAT GCA AAA TCT GTG AGA CTT GCT ATA ATG CTT GGA GCT ACT ATC 450 
CCT CTC ATT GCT TCT GTT GAC AGC TTC GAG ATG ATC ACT CTT GTT 495 

10 CTT GCC ATA TAT CAA GAT GCA AAG TAC AAG GAG TTA GGG ATT GAA 540 
CCA ACT AAG TAG AAC ACT AAG GAA GCT CTG GGG AAG GTT TGC ACA 585 
GTG CTT AAA AGC AAA GGA TTT ACA ATG GAT GAT GCA CAG ATA AAC 630 
AAA" GGG AAA GAA TAT GCC AAG ATA CTT ACT TCT TGC AAT CCC AAT 675 
GCT AAG GGA AGC ATT GCT ATG GAC TAT TAC ACT GAT AAT CTT GAC 720 

1 5 AAA TTC TAT GAA ATG TTT GGA GTT AAG AAA GAG GCC AAG ATT GCT 765 
GCT GTT GCA TAA 777 

The compete S RNA of TSWV-B should be 3049 nucleotides In 
length, 134 nucleotides longer than S RNA of TSWV-CPNH1. This 
difference was mainly attributed to the elongated intergenic region of 

20 the TSWV-B S RNA. Analysis of the sequenced region of TSWV-B S RNA 
revealed two open reading frames as depicted above, which is similar 
to other TSWV isolates. The larger one was localized on the viral RNA 
strand originating at nucleotide 88 and terminating at nucleotide 1491. 
The smaller one on the vial complementary strand was defined by an 

25 initiation codon at nucleotide 2898 and a termination codon at 

nucleotide 2122. The two open reading frames were separated by an 
intergenic region of 629 nucleotides. Comparisons of the entire 
sequenced TSWV-B S RNA with S RNA regions of other isolates in the 
following table which depicts the percent homology comparison of 

3 0 aligned nucleotide and amino acid sequences of the TSWV-B S RNA with 
those of the other Isolates: 
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91.5(79.1) 
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75.8 
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B/BL 


76.3 
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60.8 
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95.1 






92.6 


97.3 


99.2(98.5) 


L3/I 


60.9 








60.9 


69.5(55.1) 


I/BL 


61.7 








60.9 


68.8(53.9) 



a The partial or complete S RNA sequences of isolates TSWV-CPNH1 (2.916 

1 5 kb), TSWV-L3 (2.837 kb), TSWV-BL (2.037 kb) and TSWV-I (1.144 kb) were 

used for comparisons with the S RNA sequence of the TSWV-B (3.049 kb). 

D Percent similarities were calculated by Comparison of their nucleotide or 
predicted amino acid sequence using the program BESTFIT of the GCG Sequence 
analysis software package. 

20 c Percent identity is in parenthesis. 

As depicted, the greatest nucleotide sequence similarity (75.8%- 
76.4%) Was shown with the L-type isolates (CHNH1, L3 and BL). To the 
lesser extent, there was nucleotide sequence similarity (63%) between 
the TSWV-B S RNA and the S RNA of TSWV-I assigned to I serogroup. 

2 5 For comparison, the sequenced S RNA regions of the L-type Isolates 

(CHPN1, L3 and BL) shared 94.8%-96.4% nucleotide sequence 
similarities. 

The open reading frame of 777 nucleotides encodes the N protein 
of 258 amino acids with a predicted molecular weight of 28700 Da. the 

3 0 sequence comparisons of the N open reading frame from TSWV isolates 

revealed that nucleotide sequences of the N genes from the Isolates 
CPNH1, L3 and BL differs from TSWV-B by a considerably larger amount 
(22%-22.5%) than they differ from each other (2.7%-3.2%). Consistent 
to the results of the immunological analysis, the N amino acid 
3 5 sequences among CPNH1, L3 and BL isolates are more closely related to 
each other (98.8%-99.6% similarities or 96.9%-98.5% identities) than to 



the TSWV-B (90.3%-91.5% similarities or 79.1%-79.9% identities).. 
Much lower homology was observed to TSWV-I at both nucleotide 
(63.1%) and amino acid (69.7% similarity or 55.3% identity) levels. 
Except for the N open readihg frame of TSWV-I that encodes 262 amino 
5 acids, the N open reading frames of the other isolates code for the 258 
amino adds. Computer analysis suggested that the extra residues of 
TSWV-I N open readittg frartte resulted from the amino acid sequence 
insertions (residues 82 through 84 and residue 116). One potential N- 
glycosylation site is found at residue 68. 
1 0 The second open reading frame of 1404 nucleotides encodes the 

nonstructural protein of 467 amino acids with a predicted molecular 
weight of 52566 Da. Comparisons with homologous open reading frames 
of TSWV-CPNH1 and TSWV-L3 showed 80% and 79% Similarities at the 
nucleotide level, and 86.1% (or 78.3% identity) and 89% (or 82.0% 

1 5 identity) similarities at the amino acid level. This open reading frame 

contains four potential glycosylation sites, which are located in the 
exactly same positions as those of TSWV-CPNH1 and TSWV-L3. 

The intergenic regioh of the TSWV-B S RNA was, due to several 
insertions, 126 and 41 nucleotide longer than the counterparts of 
20 TSWV-CPNH1 and TSWV-L3, respectively. The sequence analysis by the 
program FOLD indicated the intergenic region can form very complex and 
stable hairpin structure by internally base-pairing U-rich stretches 
with A-rich stretches of the intergenic region, which had similar 
stability to those produced from TSWV-CPNH1 and TSWV-L3 as 

2 5 indicated by minimum free energy values. This internal base-paired 

structure may act as a transcription termination signal. 

The results tabulated above also revealed that the N protein of 
TSWV-B is subject to a higher degree of selective pressure than the 52 
K protein; the similarities among the amino acid sequences of the 52 K 

3 0 protein are lower than that found for the amino acid sequence of the 

NPs. Nucleotide sequence divergence is highest among the intergenic 
regions, which indicates that this region is subject to less selective 
pressure than either genetic region. 

The evolutionary relationships among the TSWV-B and other four 
3 5 TSWV isolates were analyzed and depicted in figure 4 in which the 
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evolutionary tree organization is consistent with the relatedness of 
serological data collected for these TSWV isolates. Thus, the TSWV-B, 
according to the present invention, is more closely related to the L- 
type Isolates than to the l-type isolate TSWV-I, but Is much less 
5 similar to the L-type iSdlat&S than the L-type isolates are to each 
other. 

Despite a slight delay of symptom expression, transgenic plants 
did not show resistance to the Brazil isolate of TSWV; Serological 
results show that this isolate is distinct from the "L" and "I" type 
1 0 Isolates, and biologically different from the curcurbit isolate. The 
Brazil isolate may thus belong to still another serogroup of TSWV. In 
any event, infectivity results show that it is unlikely that a single NP 
gene will provide resistance to all isolates in the Tospovirus genus. 
Transgenic plants according to the present invention that gave 
1 5 low or undetectable ELlSA reactions (0-0.05 OD405nm) were resistant 
to infection by the heterologous isolates (Arkansas and 10W pakchoy) of 
the "L* serogroup, whereas no protection against these isolates was 
found in plants accumulating high levels of the NP. Compared to the 
ELlSA readings of control NP(-) plants (0.05 OD4rj5nm). these 
20 transgenic plants may produce little, if any, TSWV-BL NP. Similar 
results have been observed in transgenic plants, in Which the CP 
accumulation Was not detected; these were highly resistant to virus 
infection. The mechanism underlying this phenomenon is presently 
unknown. It is likely that this type of resistance might be attributed to 
25 interference of CP UNA molecules produced in transgenic plants with 
viral replication, presumably by hybridizing to minus-sense replicating 
RNA of the attacking virus, binding to essential host factors (e.g., 
replicase) or interfering with virion assembly. 

It should be noted, however, that the resistance to the 
3 0 homologous TSWV-BL isolate is apparently independent of the 

expression levels of the NP gene. Although the relative NP levels of the 
individual Ri plants Inoculated with TSWV-BL were not measured, It is 
reasonable to assume that the NP produced In these inoculated Ri 
plants (a total of 145 plants tested) ranged from undetectable to high. 
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In contrast to the case for protection against the heterologous 
isolates of the V s§fogroup, protection against the Begonia isolate of 
the TSWV-I serogroup Was found in the high NP-expressing Ri .plants. 
Comparison of NP nucleotide sequence of the "L" serogroup With that of 
5 the T serogroup revealed 62% and 67% identity at the nucleotide and 
amino acid levels, ffi$p§ctivaly. The difference of MP genes of the two 
serogroups might be So gfSat that the NP (the V serogroup) produced in 
transgenic plants acted as a dysfunctional protein on the attacking 
Begonia isolate of the "I* serogroup. Incorporation of this "defective" 
1 0 coat protein into virions may generated defective' vjrus which inhibit 
virus movement or Its further replication. This type of interaction is 
expected to require high levels of the NP for the protection. 
Alternatively, resistance to the Begonia isolate may also involve 
interference of NP transcripts produced in Ri plants with viral 

1 5 replication. If this is true, more NP transcripts {due to the 

heterologous nature of two NP gene) may be required to inhibit 
replication of heterologous virus. 

Although there are no obvious explanations for the results 
showing the relation of NP levels in individual Ri plants to resistance 
20 to the heterologous isolates of the 1" and "I" serogroups, it is believed 
these are definite trends since the data were derived from a large 
number (190) of plants. Thus, it is believed that a measurement of CP 
or NP levels in individual plants may provide a more accurate way to 
relate NP or CP levels to resistance. By this form of data analysis, the 

2 5 results show that the resistance was more closely related to NP levels 

in each test plant than to the NP level of the Rq line from which they 
were derived. For TSWV-BL Np gene in tobacco, at least, it appears that 
integration sites of the NP gene in plant chromosome may not be 
important for viral resistance. 

3 0 Studies have also been conducted to determine the reaction of 

transgenic Ri and R2 tomatoes containing the nucleocapsid protein gene 
of TSWV-BL according to the present invention to the following 
isolates: Brazil (a distantly related virus), T91 (a closely related virus) 
and BL (a homologous isolate). In these studies, transgenic tomatoes (L. 
3 5 esculentum) were produced by A tumefaciens-mediaied gene transfer 



of the nucleocapsid protein (N) gene of the lettuce isolate of tomato 
spotted wilt virus BL into germinated cotyledons using modifications of 
published procedures (see Plant Cell Reports 5:81 (1986)]. The tomato 
line "Geneva 80" was selected for transformation because it contains 
5 the Trri-22 gene Which imparts resistance to TMV/thus creating the 
possibility of producing a multiple virus-resistant line. 

Transformants were selected on kanamycin media and rooted 
transgenic tomatoes were potted and transferred into the greenhouse. 
Rl and R2 tomato seedlings expressed the MPT II gene, suggesting 

1 0 multiple insertions of this gene in the plant genome* In contrast, only 
18% of the seedlings produced detectable levels of ihe N protein. 

Nine R1 and three R2 lines were tested for resistance to the 
following three Tospovirus described, specifically TSWV-BL, TSWV- 
T91, and TSWV-B. Infectivity was based upon visual inspection of test 

1 5 plants. In those cases Where plants appeared healthy except for a few 
rust-colored rings or insect damage, extracts from these plants were 
inoculated to N. benthamiana to test for the presence of the virus. As 
depicted in the following table, nearly all control tomato plants 
exhibited typical symptoms consisting of plant stunting, leaf yellow 

20 mosaic and rugosity 3 to 4 weeks after inoculations With TSWV-BL, 
TSWV-T91 or TSWV-B. However, only 4% of the R1 and R2 transgenic 
plants became infected with TSWV-BL, 7% with TSWV-T91, and 45% 
with TSWV-B. 
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Viral resistance In transgenic R1 and R2 tomatoes expressing the nucleoprotein gene 
of the lettuce strain of tomato spotted wilt virus 

Inoculating Isolates 3 
Plant Line TSWV-BL TSWV-T91 TSWV-B 

R1 Plants: 



T13-1 


0/22 


1/26 


7/24 


T13-2 


6/20 


MTb 


Mr 


T13-3 


2/42 


0/20 


12/18 


T13-4 


0/25 


NT 


NT 


T13-9 


0/20 


NT 


NT 


T13-10 


1/50 


2/26 


11/26 


T13-11 


0/22 


NT 


isrr 


T13-12 


1/29 


NT 


NT 


T13-13 


0/22 


NT 


NT 


TOTAL 


10/252 


3/72 


30/68 


R2 Plants: 








T13-1-7 


0/8 


2/8 


5/8 


T13-1-9 


0/8 


1/8 


2/8 


T1 3-1-1 1 


0/8 


1/9 


5/9 


TOTAL 


0/24 


4/25 


12/25 


CONTROLS 


92/95 


51/53 


52/53 



a plants were inoculated at the one- to two-leaf stage with 5-, 10-, or 20- 
fold diluted leaf extract of U. benthamiana, H423 tobacco or tomato; the same plants 
were re-Inoculated 7 days later and symptoms were recorded after another 14 days; 
the reaction Is expressed as number of plants with symptoms/number of plants 
tested 

D not tested 

Accordingly, the description above supports the finding that 
transgenic tomato plants that express the N gene of TSWV-BL show 
resistance to infection to TSWV-BL, to other TSWV isolates that are 
closely related to TSWV-BL, and to the more distantly related TSWV-B. 

In further limited studies with an additional isolate, all 
transgenic plants were resistant to the. 10W (pakchoy) isolate, whereas 
the controls were infected. These results show that transgenic 
tomatoes are better protected against closely related isolates, than 
distantly related isolates. Unlike in transgenic tobacco and N. 
benthamiana expressing the TSWV-BL N gene, the level of N protein 
expression did not correlate with the observed protection in transgenic 
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tomatoes; 55% 6f th§ transgenic tomatoes were also resistant to a 
distantly related isolate of TSWV-B, Which Was hot observed In 
transgenic tobadcd and M b&hthmim plants. These discrepancies may 

fefieet that tomato is inherently lass susceptible to Tospoviruses. 
5 In addition, studies were also conducted to determine virus 

distribute in a small number of plants at 5 and i weeks after 
Inoculation, the distal halves from leaflets of all expanded leaves of 
eaeH plant were ground and back-Inoculated onto N. benthamiana. The 
results taken seven days after inoculation Showed that virus cannot be 

1 0 recovered from any leaf tissue of asymptomatic transgenic plants 
inoculated with either TSWV-BL -T91, or -B, confirming the visual 
findings reported above. In transgenic plants showing symptoms, the 
virus Is not distributed throughout the plant. For example, a transgenic 
plant which could not be conclusively rated visually contained the virus 

1 5 in only two of the 8 leaves; the second leaves from the bottom and top 
of the plant. Conversely, virus present in all leaves of the infected 
control plant, and is absent In those of the healthy control plants. 

Graft inoculations were attempted to test Whether the resistant 
transgenic plants could become infected if virus is introduced into the 

20 vascular system. Ri and Ft2 plants that had been Inoculated at 1:5, 1:10 
or 1:2t> dilutions of TSWV-BL, -T91, or -B were grafted onto control 
plants infected With the same isolates and dilutions. The 34 transgenic 
plants Were asymptomatic after 31 days, although the non-transgenic 
controls were infected. After 23 days, the top 46 cm of transgenic 

25 plants had been trimmed away to induce new growth and more plant 
stress. Although the young, vigorously growing new shoots failed to 
Show any symptoms on the 31 st day post inoculation, 33%, 31% and 45% 
Of TSWV-BL, -T91 and -B Were showing leaf or stem symptoms, 
respectively at 45 days post Inoculation. These results indicate that 

3 0 some transgenic plants are tolerant, and others are immune to 
infection. 

Thus, according to one aspect of the present invention, transgenic 
plants expressing the HP gene of the TSWV-BL isolate are highly" 
resistant to Infections of both the homologous TSWV-BL Isolate and 
3 5 heterologous isolates of the same serogroup (Arkahsas and 10W 
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pakchoy). Mere Slghificahtly, the resistance is effective to Begonia 
isolate from other §grog"reupS. In brief, the above clearly describes 
that transgenic tbbaccd plahts expressing: the riucleoprotein gene of 
t§WV*BL display feslsl&ftte to both TSWV Hhd INSV; and the protection 
5 appears to be mediated by the rtUclidprdtein against distantly related 
IN§V ahd by the nUctiepfdteiri gene ribonucleotide sequence against the 
hombl6gOU§ fthd dl6§ely related TSWV Isolates, this is the first time 
broad spectrum resistance of the engineered plants to different 
isolates of TSWV ha§ been shown. 
1 0 While coat protein protection generally displays delay and/or 

reduction in irtfectioh and symptom expression, but ho immunity, the 
present invention provided a significantly high percentage of transgenic 
plants which were symptom-free and tree of the infective virus. 
Resistance of these plants under greenhouse conditions persisted 

1 5 throughout their life cycle* and more Importantly Was inherited to their 

progenies as shown above. 

It was observed in the present invention that the transgenic 
plants producing little, It any, TWSV-BL NP Were highly resistant to 
infection by the homologous isolate and other closely-related isolates 
20 within the Same SOrdgrOUp of TSWV, whereas no protection Was found 
in those expressing high levels of the NP gene. 

The biological diversity of TSWV is well documented and has 
been reported to overcome the genetic resistance in cultivated plants 
Such as tomato. Thus, it is extremely important to develop transgenic 

2 5 plants that shew resistant to many strains of TSWV. The present 

invention indicates that one method to do so would be to utilize the 
viral NP gene to confer this resistance, and that this resistance would 
be to diverse TSWV Isolates. Thus, the finding of the present Invention 
that the expression of TSWV NP gene is capable of conferring high 

3 0 levels of resistance to Various TSWV isolates has a great deal of 

commercial importance. 

In another series of Studies, Plasmid BIN19-N+ Was constructed 
and transferred to A. tUfiiefectehs strain LBA4404 in accordance with 
Example IV, and transferred to Nicotiana benthariiiana in accordance 
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With Sample V. The huoleocapsid genes of INSV-Beg and -LI were 
amplified with oligomer primers INSV-A 

(S'^f AdTtAf OtAGAAdOAf GQACAAAGCAAAGATTAGCAAGG) and INSV-B 
(^fA§A(3t^GATGdAtdGmmaAMTAAttTAtAAAAGCAC), 
5 hybridizing; td the S^eadlnJ ahd S'-hohcoding: regidHS of the 

nUfcl66bapsid g-§fle 6t aft IKISV isolate, respectively. The amplified 
rtudeoeapsid g§rie fragments were purified in accordance with Example 
III, find digested and sequenced in accordance with Example iv. 

<3t a total Of 24 N-J- (transformed With pBlNl9-N+) and 18 N-' 

10 (transformed with vector pBINIS) transgenic N. b'erijhamiana plants 

were transferred to Sol! and grown in the greenhouse. All N+ lines were 
cOttfifffied by PbH at leaf stages 4-5 to contain the N gene sequence. 
The relative level of N protein accumulation was estimated in each 
independent Rn transgenic clonal line by DAS-ELISA using antibodies of 

1 5 the f §WV-BL N protein. Of the twenty-four N+ lines, two had OD405nm 
readihgS of 0.50-1.00* sevehteen between 0.02-0.10, and the remaining 
five less than 0*02. Healthy N. benthamiana or transgenic N" plants 
gave OD405nm readings of 0.00-0/02. All the Ro plants were self- 
polllhated arid the seeds from the following transgenic lines were 

20 germinated on kanamyoirt (300 iig/ml) selection medium for inoculation 
tests: (1) -2 and -6, control transgenic lines containing vector 
PBIN19 alone; (2) N+-28, a transgenic tine that produced an 
undetectable amount of the N protein (OD405nm = 0.005); (3) N+-21, a 
transgenic line producing a low level of the N protein (OD405nm = 

25 0.085); and (4) N+-34 and -37, two transgenic lines accumulating high 
levels of the N protein (OD405nm = 0.50-1.00. These six lines were 
then analyzed by Northern hybridization; the intensity of N gene 
transcripts correlated well with the levels of ELISA reactions. 

Transgenic seedlings from the six Rn lines were selected by 

3 0 germinating seeds on kanamycin selection medium, and these seedlings 
were inoculated with the five Tospoviruses. The Inoculated R1 plants 
were rated susceptible if Virus Symptoms Were observed on 
uhihoculated leaves. IH order to exclude the possibilities of escapes, 
transgenic control N- plants were always used in each inoculation of 

3 5 transgenic N + plants. In addition, each inoculum extract was always 
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used to first inoculate N+ plants followed by control N" plants. The 
results from this series bf studies ate depicted below: 

Reactions of &1 plants expressing th§ tiucl&dc§p§ld (N) ftotein gene of N. 

bMUanmi sotted wilt vifu§ (t§WV) to itteeuiatidft with Tospavinsses 

5 No. plants Infected/No, plants inoculated b 



tSWV ISQkAtE >Msv.i§QLAtt= 



ftoLlrie . 


ELlSAa 


6L 


10W 




U 


TSWV-B 




*0.02 


32732 


32732 


32/32 


20/20 


32/32 




0.005 


16716 


16/16 


15/16 




16'/16 


1 0 N+-21 


0.085 


9/40 


17/40 


39/40 


18/20 


40/40 


, N+-34 


0.715 


25/28° 


28/28 


23/28° 


V 


28/28 


N+-3? 


0.510 


26728° 


22/22 


21/28° 


16/20° 


22/22 



a EUSA date of Ro lines from Which the Ri plants Were derived; 

b 304dtd diluted leaf extracts df Infected N. benthamiana plants Were applied to the 
1 5 three leaves of plants at the 3-5 leaf stages. Each extract was always used to 

Inoculate N+ plants folldWed by control N- plants. Data were taken daily for at least 
two mdfiths after Inoculation and expressed as number of plants systemically 
infeeted/number of plants maculated; 

c indlcate that nearly all susceptible R1 plants displayed a significant delay of 
20 syrfiptom appearance. 

As depicted In the above table, all R1 plants from control lines 
M--2 and -6 showed systemic symptoms 5-8 days after inoculation 
with fill the viruses tested. None of the Ri plants from line N+-28 
produced detectable levels of the N protein, and all were susceptible to 

25 these Viruses except tor ohe plant Inoculated with INSV-Beg. ELISA 
assays of leaf discs from this N+-28 Ri plant sampled before 
inoculation Clearly showed that the plant identified to possess the 
INSV-Beg resistant phehotype did accumulate a high level of the N 
protein (OD405nm = 0.78 as compared to OD4o5nm <0.02 for all other 

3 0 N+-28 Ri plants). 

The low N gene expressing line N+-21 showed the best resistance 
against the homologous (78%) and closely related TSWV-10W (57%) 
isolates and very little resistance to the two INSV isolates (3% and 
10%); enly three N+-21 plants showed the resistant phenotype when 

3 5 Inoculated With the INSV Isolates. Leaf samples from these INSV- 

resistant N+-21 Ri plants gave much higher ELISA reactions (OD405nm 
0.5 to 1.00) and thus higher amounts of the N protein than the 
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Susceptible N+^21 plants (OD405nrti 0.02 to 0.20). The high N gene 
e*pfe§Sirtg lines N+*34 and -37 Showed the highest resistance to INSV 
isolates (18%-2S%) fallowed by the homologous TSWV-BL isolate (7% 
and Wo) while nbhd of the plants showed resistance to tswv-10W; 
5 however, the N+-34 and -97 bi plahts that became infected with insv 
or f§Wv-BL did shew various lengths of delays IH symptom expression. 
Ndfie el the f=M plants ffOffi these tbur transgenic N+ lines were 
temm td tSWV-B; Softie of the R1 plants from the N+-34 and -37 
llne§ Shewed a slight delay of symptom appearance 

10 Ifl Studies to determine Whether the level of protein production 

lh N+ fti plants was associated with resistance to different 
fd§0dWruS#Sj the inoculated N+ R1 plants in the preceding table were 
fe-bfganizSd into four groups based on the intensity of their ELISA 
reactions of tlSsu§S taken before inoculation irrespective of original 

15 Rrj P&lts. the N + Hi plants that expressed low levels of the N protein 
(0.0^0,2 OD) Showed high resistance (100% and 80%) to TSWV-BL and 
"10W but Ware all susceptible to INSV-Beg and -LI, showing no 
detectable delay lh Symptom expression relative to control N~ plants. 
Ih adfflfast, hearly all N+ 81 plants With high levels of the N protein 

20 (0.26*1,00 OD) showed various levels of protection against TSWV-BL, 
INSV^Beg and -LI, f&hgihd. from a Short delay of symptom expression to 
complete resistance With most of these plants showing various lengths 
of delay ih symptom development relative to control N" plants. No 
protection Was observed Ih the high expressors against TSWV-1 OW. In 

2 5 additloh, none Of the N + &1 plants Were resistant to TSWV-B regardless 

of the level of N gene expression; however, a short delayed symptom 
appearance was observed in the N + Ri plants producing high levels of 
the N protein. All cohtrol N- Ri plants and transgenic N+ Ri plants with 
undetectable ELISA reactions (0 to 0.02 OD) were susceptible to all the 

3 0 Tospdviruses tested. 

The Inhibition Of replication of a distantly related INSV in N. 
benthamiana protoplasts expressing the TSWV-BL hucleocapsid; gene 
Was also studied. In these Studies, whole INSV-LI virions were used to 
infect protoplasts that were isolated from three transgenic lines to 
3 5 investigate how the products of the transgene affect replication of the 
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tncorhitig virus. Viral replication was determined by measuring the 
level Of the N protein of the infecting; INSV In transgenic protoplasts 
Uslild. antibodies specltib to the INSV N protein. DAS-ELISA analysis 
Showed that all progenies ffOfti a given line were relatively uniform and 
5 nearly all Ri progeny gave ah expression level of transgenic N gene 
Similar to their parental transgenic line, these results allowed for the 
prediction of the expression level of Ri populations based on that of 
their parental lines. Protoplasts derived from Ri plants of the low 

Gxpresser line N+-21 Supported the replication of INSV-LI whereas' 

1 0 protoplasts from Ri plants of the higher expressor line N+-37 did not 
until 42 hours after inoculation at Which low levels of viral replication 
Were Observed. The Same protoplasts at various time intervals (e.g. 0, 
19, 30 and 42 hours) were also assayed by DAS-ELISA using antibodies 
Specific to the TSWV-BL N protein to monitor the expression level of 

1 5 the transgene. As expected, protoplast from N+-21 Ri plants produced 
relatively low levels (0;338-0.395 OD405nm) whereas protoplasts from 
N+-37 Ri plants accumulated high levels (0.822-0.865 OD405nm). The 
expression level was found to be consistent at all time points. 

Irl this aspect of the present invention it has been shown that 

20 transgenic A/. b§nMmm& plants that accumulate low amounts of the 
TSWV-BL N protein are highly resistant to the homologous and closely 
related (TSWV-IOW) isolates, While plants that accumulate high 
amounts of this protein posses moderate levels of protection against 
both the homologous and distantly related (INSV-Beg and INISV-LI) 

25 Viruses. More Importantly, these findings indicate that transgenic N. 
benthamiana plants (a systemic host of INSV) are protected against 
INSV-Beg and INSV-LI isolates. 

As discussed above, We have shown that transgenic plants 
expressing the N gene of TSWV are resistant to homologous isolates, 

3 0 and that such plants expressing the TSWV-BL N gene are resistant to 
both TSWV and INSV. It has also been sTioWn the best resistance to 
homologous and closely related Isolates was found In transgenic plants 
accumulating low levels Of N protein while transgenic plants With high 
levels Of TSWV-BL N protein were more resistant to serologically 

3 5 distant INSV isolates. This observation led us to suspect the role of 



the translated N prbtein product in the observed protection against 
homologous ahd closely related isolates and to speculate that either the 
N prte itself Which wfis Inserted Irito th@ plant genome or its transcript 
was involved lh th§ proteciibn. to test this hypothesis transgenic 
5 plants containing the 0f6m6teri§ss N gene or expressing the sense or 
antisense untranslatable N ceding sequence Were produced. What was 
discovered Was that both sense and antisehse untranslatable H gene 
RNAs provided protection against homologous and closely related , 
isolates, and that these RNA-mediated protections were most effective 

10 in plants that synthesi2Gd low levels of the respective RNA species and 
appears to be achieved through the inhibition of viral replication. 

More specifically, the coding sequences introduced into 
transgenic plants is shown in figure 7. As depicted, the construct 
pBIN19-N contains the promoterless N gene inserted ihto the plant 

1 5 transformation vector PBIN19 (see Example IV). All other constructs 
contain a double 35S promoter of CaMV, a 5-untranslated leader 
sequence of alfalfa mosaic virus and a 3'-untranslated/pblyadenylation 
sequence of the nopaline synthase gene. pBI525 is a plant expression 
vecdr and is used irt this study as a control; pBI525-mN contains the 

20 mutant (untranslatable) fdi-hl of the N gene; pBl525-asN contains the 
antisense form of the untranslatable N gene. One nucleotide deletion at 
the SMermirius of the mutant N gene is indicated by the dash Symbol. 
ATG codons are underlined and inframe termination codons in the 
mutant gene are shown in bold. 

25 EXAMPLE Vlll 

Primer-directed mutagenesis and cloning of the TSWV-BL N gene 
was conducted as follows: 

Full-length N gen§ Was obtained by reverse transcription and 
polymerase chain reaction as described in Phytopathology 82:1223 

3 0 (1992), the disclosure df Which is incorporated In toto herein. The 
untranslatable N coding sequence was similarly generated by RT-PCR 
using oligomer primers A 

(AQOATTQGAtCCAtGQTtAACACACTAAGCAAGCAC), Which is identical to 
the S RNA In the 3'-rtoncoding region of the TSWV-BL N gerte, and B 
3 5 (AGCTAATCTAGAACCAtGGATGACTCACTAAGGAAAGCATTGTTGC), 
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complementary to the S ttNA IH th6 5*-terminus of the N gene. The 
latter Oligomer primer contains a frameshift mutation immediately 
after the translation Ihltlatldh fcodori and several termination codons to 
- bldfck possible translation feadtHrOughs. the intact and mutant N gene 
5 fragments were purified m a agarose gel as described in Example 
II. the gel-isbtated Intact arid mutant N gene fragments were digested 
With the appropriate testation ertzyme(s) arid directly cloned Into 
SamHI/Kbai-digested plant transformation vector pBINi9 and Ncol- 
dlgested plant expression vector pBI525, respectively as described in 

1 0 Example IV. the resulting blasmids were identified' and designated as 
pBIN194l containing the IhtacL promotertess N gene, and pBI525-mN 
end pBI525-asN containing; the mutant coding sequence in the sense and 
antisense orientations, respectively, relative to cauliflower mosaic 
virus 3SS promoter, the translatability of the mutant N coding 

1 5 sequence in the expression cassette was checked by transient 

expression assay in Nicdiiatia tabacum protoplasts; and the expression 
cassettes containing the sense or antisense mutant N coding sequence 
were then excised from plasmld pBI525 by a partial digestion with 
Hindlli/ECORI (since the N coding sequence contains internal Hindlll and 

20 ExoRI Sites), end ligeted Into the plant transformation vector pBIN19 
that had been cut with the same enzymes. The resulting vectors as well 
as pB!Ni9-N were transferred to A. tumefaciens strain LBA4404 using 
the procedure described in Example IV. Leaf discs of N. tabacum var 
Havana cv 423 Were Inoculated with the A. tumefaciens strain LBA4404 

25 containing various constructs and the resulting transgenic plants were 
self-pollinated and Seeds were selectively germinated on kanamycin 
medium. 

PCR was performed oh each Ro transgenic line as described 
above, the oligomer primers A and B were used to determine the 

3 0 presence of the N coding; Sequence of TSWV-BL The oligomer primer 

35S-pr0moter (CCCACtAtCCTTCGCAAGACCC) Was combined with either 
the Oligomer primer A or B to confirm the orientation (relative to the 
OaMV promoter) of the mutant r\l coding sequence inserted into the 
plant genome. DAS-ELISA used to detect the N protein in transgenic 

3 5 plants Was performed using polyclonal antibodies against the TSWV-BL 
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N prdtein. F6I- m estimation of UNA transcript level In transgenic 
plants by Northern blot, total plaht ftfoAs Were isolated according to 
Napdli {see the Plant §eil ££79 (1990)], find Were separated on a 
foffr1§ia§hyd^66rttfilftffH9 a§&fd§8 del (10 itgTiahej. the agarose gels 

s ware than stained with mHldtum bromide to ensure unifarmaiity of 
total piant fiNAs In each lane; Hybridization conditions ware as 
described In the GefteScteen Plus protocol by the manufacturer. 
Resulting slgrtal bldts were compared and normalized based on the»N 
geftg transcrtpt bahd of tha control lane (tha mN Pi plant producing a 

1 0 high level of the N cjene transcript) Included in each blot. The 

transgenic plants that gave density readings (Hewlet ScanJet and Image 
Analysis Program) between 100 and 150 Were rated as high expressors, 
while the plants With densities between 15 and 50 Were rated as low 

expressors. 

1 5 Inoculation of transgenic plants with Tospovirus was carried out 

as described above With Inoculation being done at the 3-4 leaf stage 
except were indicated. 

Tobacco protoplasts were prepared from surface-sterilized 
leaves derived from FH plants [see Z. Pflanzanphysiol. 78:453 (1992) 

20 With modifications!, the isolated protoplasts (6 x 10^ protoplasts) 
were transformed With 0.68 OD260nm of the purified TSWV-BL virion 
preparation using the PEG method [see Plant Mol. Biol. 8:363 (1987)]. 
The transformed protoplasts were then cultured at the final density of 
1 x 10 s protoplasts /ml in the culture medium at 26°C in the dark. 

25 After various intervals of Incubation, the cultured protoplasts were 
washed twice with W5 solution and lysed by osmotic shock in the 
enzyme conjugate buffer. Viral multiplication (replication) was 
estimated by measuring the N protein of the virus Using DAS-ELISA. 
As described, one aspect of the present invention demonstrated 

3 0 that transgenic tobacco producing none or barely detectable amounts of 
the N protein were resistant to homologous and closely related isolates, 
this result suggested that the observed resistance may have been due 
to tfans interactions of the incoming viral N gene ftNA with either the N 
gene transcript produced In the transgenic plants or the N coding 

3 5 sequence Itself. To test Whether the presence of the nuclear N gene 
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plays a role, transgenic P*H Rrj lines and Ri plants from two P°N lines 
were challenged with fauf tdspovituses (TSWV-BL, TSWV-loW, INSV- 
Beg and TSVW-B). Only asyrnptefilatic plants were fated resistant 

while plants shawing any gyffibtaffls were rated susceptible. An 

5 ihdcUlflted Rrj and &t plants Were Susceptible to th§ Viruses. 

To further test M fusibility that the trahScrlpt of the N 
transgene Is involved In the protection, a number af R 0 transgenic 
plants that produced either the sense or the antisense N gene transcript 
but ftdt the N protein were inoculated with the homologous isolate. 
1 0 Results appear in the fallowing table: 

Pom af Level of N Na. of RO No. of lines No. of lines 
transgene* gene RNA D lines tested inoculatedc resistant 



mN 


H 


8 


4 


0 




L 


17 


16 


16 




nd 


4 


1 


0 


asN 


H 


6 


3 


0 




L 


9 


5 


5 




nd 


1 


0 


0 




nd 


12 


6 


0 



2 0 8fnN ahd asN represent plants expressing the sense and antisense untranslatable N 
gfenesj respectively, P°H represents plants containing the promoterless N gene; 

b the level of the H gene RNA Was estimated in each line by Northern blots, nd 
indicates that the N gene transcript was not detected; 

c 30-fold diluted leaf extracts af the N. benthamlana plants infected With TSWV-BL 

2 5 were applied to three leaves af each plant at the 6-7 leaf stage. Each extract was 

first applied td all test plants tolldWed by control healthy plants. Data were taken 
daily for 45 days after inoculation and only the asymptomatic plants were rated 
resistant. 

Unlike the controls, Which developed typical systemic symptoms 

3 0 7 to 9 days after inoculation, 16 out of 21 mN plants and 5 our of 8 asN 

plants were asymptomatic throughout their life cycles. Northern blot 
analysis of leaf tissues sampled before inoculation showed that all the 
resistant fin lines produced low levels of the sense or antisense N gene 
RNA, Whereas the susceptible Rrj lines produced either none or high 
3 5 levels of the RNA species. Since this data suggested that the N 
resistance of transgenic plants to .TSWV-BL was related to their 
relative levels of N gene transcript, transgenic progenies from four mN 
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n 



and three asN fto lifted With either high or low N gene transcript levels 
were selected by 9§fhiif)atiGh on kanamycin-contaihing media. These 
trarisgenic plafite were tested tof resistance to the four Tospoviruses 
at the 3 to 4 leaf stage, except that sorne &i plants from two asN lines 
Were Inoculated at the 6 to 7 leaf stage, the results are summarized in 
the following table: 
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Ail H\ plants ffom high expressor lines mN-2 and mN-7 were 
susceptible to infections By all tospoviruses tested, and these plants 
did hot show a delay 6t symptom appearance as compared to controls, 
in contrast, high prep6rtidhs 6t the R1 plants from low expressor lines 

5 rnN-18 aftd -1fi ware fesistart to homologous (TsWV-bi_) ahd closely 
related (tSWV-lOW) Isolates, but hot resistant to infections by 
distantly related to$pdVlirU§l}& (INSV-BSg and TSWV-B). The resistance 
of asN fti plants from I6W expressor Ro lines was markedly influenced 
by tH§ fSWV isolate used tor Inoculation. All but one of the small Ri 

1 0 plants (34 leaf stage) from low expressor lines asKM and -9 became 
infected, although there Was a delay of symptom appearance, when 
inoculated With the homologous TSWV-BL or closely related TSWV-10W 
isolates. In contrast, most of the large Ri plants (6-7 leaf stage) from 
line aSN*9 were resistant to both isolates. In comparison, control Ri 

1 5 plants ahd Ri plants from the high expressor line such as asN-4 

displayed no resistance to either of the isolates regardless of the size 
of test plants. Antisense RNA-mediated protection was not effective 
agaiftSt infeotioh by the distantly related. INSV-Beg and TSWV-B 
isolates. 

20 Analyses of data presented In the above two tables suggest that 

Sense ahd antisense RNA-mediated protections are observed only in low 
expresses of the N gene; The R1 asN plants that produced high levels 
of the antisehSe N gene transcript were as susceptible as control 
plants. Ih contrast, the asN low expressors displayed a delay in 

25 Symptom appearance When ihoculated at the 3-4 leaf stage and showed 
Increased levels Of resistant When inoculated at the 6-7 leaf stage. 

Inhibition of viral replication in tobacco protoplasts expressing 
the sense or antisense form of untranslatable N coding sequence was 
also noted. In this instance, whole virion preparations of TSWV-BL 

30 were used to transfect protoplasts isolated from transgenic lines to 
investigate the effect of sense or antisense N gene transcript on 
replication of the Incoming Virus. Viral replication was determined by 
measuring the level bf the N protein of the Incoming virus in 
transfected protoplasts, and it was found that protoplasts derived from 

3 5 plants (mN-7 and asN-4) that produced high levels of the respective 
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ftNA transcripts Supported the replication of the Virus, Whereas 
pf6t6pfSStS frdm ffiN \W e5tpreSS6r (rrtN-18) did Hot. Protoplasts from 

an fisN low ejtpmsset- (a§N-9) supported much lower levels of viral 
replication. 

5 Aecbfdirtgly, In tHIs aspect Of the present iHvehtion We have 

ShoWh that transonic plaht§ expressing S§HS§ tit ahtisense form of 
UfttmftSlatable U Q6h§ 66dlH§ sequence are rdSlStaht td hdmologous 
(t§WV-§L) artd cldSely related (TSWV-10W)> but hdt to distantly . 
related (IN§V4e0 fifld TSWV-B) Tospovirutes. the following table 
i 0 provides a cerhpafiSdh ef resistance to Tospovitusds between 

transgenic tobacco expressing various forms of the TSWV-BL N gene: 

Form of the Transqenf> a 





Homology to 












TSWV-BI* N Gene b 


N 


ffiN 


asN 




TSWV-BL 


100% 


R 


R 


RC 


S 


t$WV*1 ow 


99% 


R 


R 


RC 


S 


INSV-Beg 


60% 


RC 


S 


S 


S 


TSWV-B 


78% 


S 


S 


S 


S 



Reactions of transgenic tobacco and N. benthamiana plants expressing the intact N 

2 0 gene (N) of TSWV-BL to Inoculation With the four Tospovlruses are included for 

comparisons with modulation results of transgenic plants containing untranslatable 
(mN) , anflsense (a$N)* and promoterless (P°N) N coding Sequences , R = resistant, 
s * susdeptibie; 

b the nucleotide sequences are as reported in Phytopathology 82:1223 (1992) and 
25 Phytopathology 83:728 (1993) 

c level of resistance may depehd upon the concentration of inoculum. 

these results Oonflrm and extend the earlier aspects of the 

present invention for RNA-mediated protection with TSWv. 

Furthermore, the protection is observed in plants producing low rather 

3 0 thah high levels of the N gene transcript, and although earlier studies 

reported herein indicate that tobacco plants which produced high levels 
tif the tsWV-6L N protein displayed resistance to INSV-Beg, this 
additional data indldates that since resistance to INSV-Beg was not 
observed in transgenic pl&hts expressing the sense or antisense form of 
35 the untranslatable of the N gene thus clearly indicating that protection 
against INSV-Beg IS due to the presence of the N protein and not the N 
gene transcript. Thus, it appears that two different mechanisms are 



52 



involved in protection transgenic plants against tsWV and iksv 
Td$p6\/iruse$ according td the present invention. One mechanism 
involves the N gene ttaris&flpt (ftNA-mediated), and another involves the 
N ptdtelft {pf5t8ln-rfi6didat§d) 4 In addition) the results of the protoplast 
5 §xp§flfh&m$ Indicate thai N g§n§ ftNA-riiediat&d pfdtectioh is achieved 

through a process that inhibits viral replication, and the data contained 
in the above tables suggest that protection against the distantly 
related INSV*Beg isolate is conferred by the N protein of tsvvv-bi, and 
not by the gene transcript. 

10 Finally, further Studies were conducted to provide still another 

aspect Of the present invantioh - that a portion of the Tospovirus 
nucleopfoteih gene provide protection of transgenic plants against 
infection by the T66pdvitU&. It has already been demonstrated above 
that the N gene RNA protects against homologous and closely realated 

1 5 tSWV isolates While - the N protein protects against the homologous 
Isolate and distantly related IMSV isolates; that N gene RNE-mediated 
protection is effective in plants expressing low levels of the N gene 
Whereas N protein-mediated protection requires high levels of N protein 
accumulation; and that the N gene RNA-mediated protection is achieved 

20 through inhibition of viral replication. Based upon this prior data, we 
ftext Set out to determine Whether a portion of the N gene might work 
against infection by the virus. We found, as discussed below, that 
transgenic plants expressing about one-half of the N gene sequence is 
resistant to the virus. 

25 the following describes the cloning of one-half N gene fragments 

of TSWV-BL in order to demonstrate this final aspect of the present 
invention, the first and second halves of both the translatable and 
untranslatable N gene were generqated by reverse transcription and 
then PCR as described above. As depicted in figure 8, the nucleotide 

3 0 deletion or insertions at the 5'-terminals of the Untranslatable half N 
gene fragments are Indicated by the dash symbol; AtG codons are 
underlined and all possible termination codons immediately after the 
ihitiatiOh codon of the Untranslatable half N gene fragments are N shown 
in bold. c „ 



the first half Of the N gene was produced by RT-PCFt using 
biigfipHmete I (S'-fA8Al§f6t3AtCCAtGGTtAAGQfAAtCCATAGGCTTGAC), 

wHioh Is eortiplememary te the central region ef the tsWV-bl N gene, 

and « (S*-A^6TMi6Af^Q1i'MGCtcActMGGAMGCA1tGtTGC) for the 

5 translatable or ill 

(S^A^tMtdtAGAAObAtGGATGACTCACtAAGdAAAGCAtTGTTGCJfor 
the Untranslatable flfftt half N gene fragment, the latter two oligomer 
primers are idantidal to the S'-termihus of the M gene. Similarly, ,the 
secbHd half of the N g^ftS was produced by ftf-Pcft usihg oligomer 
1 0 pflftiefS iy (6'*AGCAtt4^AtCCATGGTTAACACACtAAGCAAGCAC) which 
Is complementary td th§ 3'-honcodihg region of the TSWV-BL N gene, 
arid v (5-TAOAGTTOtAdlAACCATGGATGATGCAAAGTCTGTGAGG) for the 

translatable or vi 

(B'-AGATTCTCtAGACCAtGGTGACTTGATGAGCAAAGTCTGTGAGGCTTGC) 

1 5 for the untrahslatablesecortd half N gene fragment, the latter two 

oligomer primers are Identical to the central region of the N gene. The 
oligomer primer Hi cohtaihS a frameshift mutation immediately after 
the translation codort and several termination codohs to block possible 
translation readthroughs while the oligomer primer vi contains several 

20 inframe termination codohs immediately after the translation 
initiation codon. 

the half gene fragments were purified on a 1.2% agarose gel as 
described above, and the gel-isolated gene fragments were digested 
with the restriction enzyme Ncol and directly cloned into Nco\ 

25 -digested plant expression vector pBI525. The resluting piasmids were 
identified and designated as (1) pBI525-1N containing the first half 
translatable N gene, (2) pBI525-1N' containing the first half 
untranslatable N gene, (3) pBI525-1N- containing the first half 
translatable N gene in the antisense orientation, (4) pBI525-2N 

3 0 containing the second half translatable N gene, (5) pBI525-2n' 

containing the second half untranslatable N gene, and (6) pBI525-2N- 
corttalfllng the second half translatable N gene in the antisense 
orientation, the expressln cassettes were then excised from plasmid 
pBI52S by digestion with W/ndll/EcoRI and ligated as described above 

3 5 into the plant transformation vector pBINl9 that had been cut with the 



54 



same enzymes. The resulting vectors as well as plasmid pBlKl19 were 
transferred to A tumtamhS strain LBA4404, using the procedure 
described by Holsters SU0ra. Leaf discs of N. beritharnlana were 
modulated with A. tummmtas strain LBA4404 eemairting the various 
5 constructs, transgefita plants were self-pollinated -arid seeds were 
selectively germTriated 6rl kafiamycin as described above. 

Aftalyeis of trUHSgerile plants by pcr and Ndrthern hybridization 
PcR wis perfdfmed art eaah R 0 transgenic line as described previously. 
The dllgemer primers I ta vi were used to determine the presence of the 

1 0 N c6difig sequence 6f TSWv-BL. The oligomer primer 35S-Promoter (see 
Example VIII) Was 66fnblned With one of the above oligomer primers to 
confirm the orientation (relative to the CaMV 35S promoter) of the half 
gene Sequences inserted Ifflo the plant genome. Northern analysis was 

conducted as described in Example vm. 

1 5 Lettuce isolate of f SWV (TSWV-BL) was used to challenge 

transgenic plants. Inoculation was done using test plants at the 3-4 
leaf stage as described above. To avoid the possibility of escapes, 
dontfb! pants were Used Irt each experiment and each inoculum extract 
was used to first inoculate the transgenic plants followed by control 

20 plants. 

The various constructs used in this aspect of the present 
invention are illustrated in figure 8. Translatable and untranslatable 
half N gene frademntS were synthesized by RT-PCR and then cloned 
directly into the plant expression vector pBI525. The oligomer primers 

25 lii and vi, used for generation of untranslatable half N gene fragments 
by RT-PCR, contains a mutation immediately after the translation 
initiation codort ahd the resulting reading frame contains several 
termination codons to block possible translation readthroughs. Thus, 
both first and Second halt untranslatable N gene fragments should be 

3 0 Incapable of prodcing the truncated N protein fragments when 

irttrddueed into plants. Both translatable* and untranslatable half N gene 
fragements were then placed downstream of the caMV 35S promoter of 
the vector pBlS25 irt the Sense orientation or in the antisense ^ ^ 
orientation. The expressln of the half H coding sequences of TSWV-BL 

3 5 was thus controlled by a double CaMV 35S promoter fused to the 5'- 
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untranslated leader sequence of alfalfa mosaic virus (ALMV) of the 
expression vector pBl&2& Impression vectors that Utilize the Stacked 
double daMV 35S ptfirfldtef elements are known to yield higher levels of 
mftNA trnscription than similar vectors with a single 35S promoter 
5 element. Expression cassettes wefe transferred f>6rri the Vector 
PBISSS to the pant MftslafrMtion vector pB!Nl9. the resluting 
plasmlds as well as the OOfflroi plaimid PBIN19 wefe then transferred 
into A. tumBtedm strain LBA4404. Transgenic plants were obtained 
with nomenclature of the transgenic lines Shown in figure 6. 
10 All the kanamycin^resistant transgenic lines Were confirmed by 

PCR to contain the proper N Coding sequences In the expected 
orientations. Each tfansgeinc Ro line which Was grown for seeds was 

then assayed using Northern blot. Six out of six 1N, four out of six 1N\ 
six out of six 1N-, six Out Of Six 2N, seven out of eight 2N\ and six out 
15 of seven 2N- transgenic Rrj lines were found to produce halt N gene 
RNAs. 

A set of transgenic Rrj plants Was challenged With the 
homologous Isolate TSWV-BL Only asymptomatic plants were rated 
resistant while the plants' showing any symptom (local lesions or 

20 systemic infections) were fated susceptible. All the inoculated Rn 
control plants were Susceptible to the virus; in contrast, two out of 
nine 1N\ two out Of six IN-, four out of ten 2N', and ohe out of eight 2N" 
Rrj lines were found to be completely resistant to the virus infection. 
Although none of the 1N and 2N Rrj lines showed high levels of 

25 resistance, some Of those plants displayed significant delays of 

symptom appearence. 

Ahother Set of transgenic Rrj lines was brought to maturity for 
seed production. Seedlings were germinated on kanamycin-containing 
medium and inoculated With TSWV-BL As shown in the following table, 
3 0 control seedlings and seedlings from some of the transgenic lines were 
susceptible to the Isolate Whereas seedlings from lines ifsl-151, IN'- 
123, and 2N'-134 Showed Varlojs levels of protection ; ranging from 
delays ih symptom expression to compete resistance. 
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10 





No.blants Infected/Mo, blants inoculated 


RO Htie 


6DPI 


15DPI 


30DPI 


Cdrttrdl 


50/50 






1NM49 


17/1? 






1N-151 


2/20 


13/20 


17/20 


1NH23 


16/20 


17/20 


17/20 




20/20 






1NM&6 


15/19 






1N^130 


12/16 


15/15 






18/19 


19/19 




2N-155 


20/20 






2NM34 


0/20 


10/20 


10/20 


2NM35 


19/19 






2NM42 


20/20 






2N--143 


20/20 
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In the above table, 30-fold diluted extracts of Infected N. benthamiana were 
used to inoculate transgenic plants at the 3-4 leaf stage followed by control 
transgenic plants. DPI a days post inoculation. 

In summary, this aspect of the present Invention shows that 

20 transgenic plants expressing the first or the second half of either 

translatable or untranslatable N gene fragment are highly resistant to 

the hemologous TSWV-8L isolate. This result demonstrates that a 

portirt of the N gene is Sufficient for resistance to the virus. 

A listing of all nucleotide and amino acid sequences described in 

25 the fofegoihg descriptioh of the present invention is as follows: 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(I) APPLICANT: behnis Gonsalves and Sheng-Zhi Pang 
(ii) TITLE OF INVENTION: Tomato Spotted Wilt Virus 
3 0 (iii) NUMBER OF SEQUENCES: 30 

(2) INFORMATION FOR SEQ ID NO:1 : 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base' pairs 

(B) TYPE: nucleic acid 
3 5 (C)S1t4ANDEDNESS: single 

(D)TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 
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agcaggcaaa ACTCGCAGAA. CTTQC 25 
(2) INFORMATION FGR SlG ID N02: 

(i) SEQUENeg CHARACTERISTICS: 

(A)Lj=NGtH: ' 25 base pairs 
5 (BJTYPfe nucleic add 

(C) StftANtiEbNESS: single 
(b)t8f*6Le§¥:, linear 

(ii) MoL£cUL&T¥l%: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N02: 
* 0 tXaAG&TCaS CGAGTnTGC CTGCT 25 

® INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
1 5 (C) STRANDEDNESS: single 

p) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(xi) sequence description: seq id no:3: 

agctaAccM 1 gghaagctc actaaggaaa gc 32 

2 0 (2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

2 5 (D) TOPOLOGY: linear 

(H) Molecule type: dna 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

AGCATTCCAT GGTIAACACA CTMGCAAQC AC 32 
(2) INFORMATION FdR SEQ ID NO:5: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear * 

3 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N05: 

cmgitgaaa. gcaacaacag aactgtaaat tctcttccag TGAAATCTCT " 50 

GCTCATGTCA GCAGAAAACA ACATCATQCC TAACTCTCAA GCTTQCACTG 100 
AI*CTCATTT CAAGCTGAGC CTCTGGCTAA GGGTTCCAAA GGTTITGAAG 150 
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(AGMXXA I^GAAaIT G^TCAAGGIT GCAGGAGATG AAACAAACAA 200 

MAWem? TTfiftaxMte Oclfecarrcc aaaccataac agtgttgaga 250 

(IfflRlA C&Mfffl A^ftQCAAGC A3X3kGCTCCC AtafrxSCAAA 300 

MAMStg cffi&AMft Meaatgatg TmxnteMT t&aaggrgcc 350 

5 ttMkftSAM G£fcftft&C£ (M£iA3&G0C CAAAGGMfcG (StOCAATQC 400 

Mi&SjdGk Mc&Ag&a I^JMgcAgA Ab'iiuiite Mctracttg 450 

o&li&fcaiG *iAa3xA»M cfccriTGAAC aacctegagc tractcctgg 500 

AAAfflftGAtf ttfcGGlX3AAA GAACCTTG^A TTACAGTGAA GftTGOCTACA 550, 

Aaa&Mata t&xxiffick MMgotg AATGrcrrrcc atceaacaca 600 

0 CaAftdSATGl? CT^CITAGA CAGCATCCAA ATCCdTCAT GGAAGATAGA 650 

CTi^QOGAGA. GGAGAAA1TA AAATITCTCC ACAAICTATT TCAGTTGCAA 700 

AM^CSraGIT AAAGCTT^GAT TIAAQCGGGA TCAAAAAGAA AGAATCTAAG 750 

GWMfiGAAG aSJmSCttC AGGATCAAAA TAATCTTQCT TK3TCCAGCT 800 

'n'HtjliAATr ATCTifttfGrr tAilTiurrr crrjjAcnM! AATimTicr 850 

1 $ caxMftGfrcA ixSxrntAA AiTCXTOCTG TCEAGTAGAA AOCAIAAAAA 900 

CAAAAAATAA AAATGAAAAT AAAATTAAAA TAAAATAAAA TCAAAAAATG 1000 

AAA^AAAAAC ARCAAAAAAT tAAAAAAOGA AAAAOCAAAA AGACCCGAAA 1050 

QQC^CGAAtt? tGGCCAAATT 1X3GGTTTTGT TTITGTITIT TbTlTiTlGT 1100 

I'l'i'IWlTi' TEtenAlT i 4 l'iAlTri!AT TTEA1T1TIA. iTllAlTlTi' 1150 

2 0 AilTMll'iA. 'ii'i'ii'iUrrr ^ObTiurriT tgtiatttta ttatttatia 1200 

AGC&GAACAC ACAGAAAGCA AACTTTAATT AAACACACIT ATTTAAAATT 1250 

TAAdAGACTA AGCAAGGACA AQCftATRAAG ATAAAGAAAG CTTIATATAT 1300 

TlMiAGGCTT TTTTATAAlT TAACTIACAG CTQCTITCAA GCAAGTTCTG 1350 

aAGTriTGC CTGCI'IITIA. ACCCCGAACA TITCATAGAA CTrGTEAAGA 1400 

2 5 GTI*rGACTGT AAlGTTOCAT AGCAACACTC CCTTEAGCAT TAGGATTGCT 1450 

GGAGCEaAGT AlAGCAGCAT ACTCTITOCC CTTCTTCACC TGATCTTCAT 1500 

TCAJftTCAAA TGCTI*GCTT TTCAGCACAG TGCAAACTIT TCCTAAGGCT 1550 

TCCTTGGTGT CATACTiLTi' 'TGGGTOGATC COGAGGTCCT TGTATTTPGC 1600 

A3XXTGATAT AlAGCCAAGA CAACACTGAT CATCTCAAAG CTATCAACTG 1650 

3" 0 AAGCAAlAAG AGGTAAGCTA CCTCOCAGCA TTAIQGCAAG TCTCACAGAC 1700 

ttfdCATCA* CGAGAGGIAA ibCAIAGGCT TGAAICAAAG GATGQGAAGC 1750 

AAlXS^AGAl? l^GA^AJaEAT l^GATTCTC AGAATTCCCA GTTTCTTCAA 1800 

CAAGOCTGAC CCTGATCAAG (MiCAAQCC TTCTGAAGGT CATGTCAGTG 1850 

CXJTCCAATCC TGTCTGAAGT ITIUITIATG GTAATTTIAC CAAAAGTAAA 1900 

3 5 ATCGCfTTGC TTAATAACCT KATTATGCT CTCACGATTC TTTAGGAATG 1950 
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TCAGACATGA AATAAdGOdC ATCITCITGA TCTGGTCGAT GT1T1CCAGA 2000 

CAAAAAGfCT *GAAMGAA $GCTACCAgA TTCTGATCflT (XTCAAACTC 2050 

AAG^ftMxs txtffilfeifcA. AcMAgc&ac AATGCTTTX TTAGTGAGCT 2100 

TAACXJttAGA. CASGASsASid GiftAAAGCTG TtfflAGCEET GACCGTATCT 2150 

AACfKAAGGT GCGAA&G^ AAcSCTGTAT CXX^G^ 'ITTCTD^GGr 2200 

*CMA5G3?G AISgMMA AgACTEAGTG TiAADGfcATG AACACAAAAT 2250 
1GAC&3GATT GCttft 2265 
(2) INFORMATION f=OR §EQ lb NO:6: 

(0 Sequence characteristics: 

(A) LENGTH: 1709 base pairs- 
(8) TYPE: nucleic acicf 

(C) StRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

AAAITCTCTT GCAGTGAAAT CTCTGCTCAT GTTAGCAGAA AACAACATCA 50 

TQCCTAACTC TCAAGCTlTr GTCAAAGCTT CTACTGAITC TAATTTCAAG 100 

CTGAGCCTCT GGCTAAGGGr TCCAAAGGTT TTGAAGCAGA TTTCCATTCA. 150 

GAAAlTGTTC AAQGITXBCAG GAGATGAAAC AAATAAAACA TITIMTIAT 200 

C3ATIGCCTG CATTOCAAAC CAlRACAGTG TTGAGACAGC TTIAAACAIT 250 

AGJ^GTEATlT GCAAGCATCA GCTOCCAAIT CX3EAAATGTA AAACTGCTTT 300 

TGAAiPEATCA ATGAlGTTTT CTGATTEAAA GGAGCCTTAC AACATEAITC 350 

AlfiAlCCtDC ATATCCCCAA AGGATTGITC ATGCTCTGCT TGAAACTCAC 400 

ACATCTTITG CACAAGTTCT TTGCAACAAC TTGGAAGAAG ATGTGATCAT 450 

cmcAccrtG aacaaccatg AGCTAACTCC TQGAAAGTTA GATTCAGGTG 500 

AAAiAActlt gaateAcaat gAAgacgcct acaaaaggaa ATArnccrr 550 

TCAAAAACAC TTGAATCTCT TCCATCEAAC AIACAAACEA TGTCTEATTT 600 

AGACAGCATC CAAATCXXTT CCTGGAAGAT AGACTTTGCC AGGGGAGAAA 650 

TTAAAATCTC TCCACAATCT ATTTCAGTTG CAAAATCnT GTTAAATCTT 700 

GATTfCAAGCG GGATEAAAAA GAAAGAATCT AAGATTAAGG AAGCATATGC 750 

TTCAGGATCA AAA1GATCTT GCTGTGTCCA GCTTTITCIA ATTATGTTAT 800 

bi'^iAiLTllC lTlUL'i'jJfidP TATAAITATT TTTCTGTITG TCAlTltTIT 850 

cAAAfficcrc cigtctag'ia gaaaccataa aaacaaaaat aaaaataaaa . 900 

TAAAATCAAA AlAAAAiAAA MtcaaAaaA tgaAaiaAaa GCAACAAAAA 950 

AAtTAAAAAA CAAAAAACCA AAAAAGATCC CGAAAGGACA ATTTTGGCCA 1000 

AATI*rGGGGT TTCT1TITGT ITITrGTTTT 'ITlUlTmT GITlTiATTT 1050 
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TlAll'H'JLAI TlTlMl'lTl' Ai'lTUAlTlT AIT1TATGTT TTIGTTGTIT 1100 
ttGffl&m GTTATTTATT AAGCACAACA CACAGAAAGCA AACTTTAAT 1150 
TAAkd&ACT TA&AAftAl' TiAAC&SACT AAGCAAGCACA AACAAtAAA 1200 
GAtA^AGAAA QCSClftSRM finaiAGQCT TTTTIAIEAAT T^lAACTTACA 1250 
5 GCSGGTfeA. AGCAA6M? GftSAGITITG raXJITITPr AAGCraAAC 1300 

AiSW&GA A^ErcriMd &M£Acix3 TAAixMfocA tagcaatact 1350 

TOMMA TlAGGMtoG TMGCTfcAG TATAGCAGCA. TAClCTrECC 1400 
(Xftffi^ dEGAlCI^CA TICAnTCAA ATXaCTTTICT TTTCAGCACA 1450 
GTGGMACIT 3 TlttSAAGGC TTCCCTGGTG TCATACTTCT TTGGGTCGAT 1500 

1 0 OXGAQATCC TIGlAITrrG GAICCTGATA TAIAGCCAAG ACAACACTGA 1550 
fcAl&GAAA GCEVTCAACT GAAGCAAIAA GAGGTAAQCT ACCTCCCAGC 1600 
AlTAlGGCAA GCCrCAGAGA CtTTGCATCA TCAAGAGGTA ATCCATAGQC 1650 
TTGAAifGAAA GGGTGGGAAG CAATCTTAGA. TTTGATAGIA TTGAGATTCT 1700 
CAGAAlTCC 1709 

1 5 (2) INFORMATION FOR SEQ ID NO:7: 

(I) SEQUENCE CHARACTERISTICS: 
V (A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 

(>d) SEQUENCE DESCRIPTION: SEQ ID NO:7: 





Gin Val Glu 


Ser Asn Asn Arg Thr Val Asn Ser Leu Ala Val 


Lys 








5 


10 


15 


25 


Ser leu Leu 


Met 


Ser Ala Glu Asn Asn 


He Met Pro Asn Ser 


Gin 








20 


25 


30 




Ala Ser Thr 


Asp 


Ser His Phe Lys Leu 


Ser Leu Trp Leu Arg 


Val 








35 


40 


45 




Pro Lys Val 


Leu 


Lys Gin Val Ser He 


Gin Lys Leu Phe Lys 


Val 


30 






50 


55 


60 




Ala Gly Asp 


Glu 


Thr Asn Lys Thr Phe 


Tyr Leu Ser He Ala 


Cys 








65 


70 


75 




lie Pro Asn 


His 


Asn Ser Val Glu Thr 


Ala Leu Asn He Thr 


Val 








80 


-85 


90 


35 


lie Cys Lys 


His 


Gin Leu Pro lie Arg 


Lys Cys Lys Ala Pro 


Phe 








95 


100 


105 




Glu Leu Ser 


Met 


Met Phe Ser Asp Leu 


Lys Glu Pro Tyr Asn 


ile 








110 


115 


120 




Val His Asp 


Pro 


Set Tyr Pro Lys Gly 


Ser Val Pro Met Leu 


Trp 


40 






125 


130 


135 
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Leii Glu Thr His Thr Set Leu His Lys Phe Phe Ala Thr Asn Leu 

i40 145 150 

Gift Glu Asp Val lie life Tyr Thr Leu Asn Asn Leu Glu Leu Thr 

155 160 165 

5 Pro Gly Lys Leu Asp Leu Gly Glu Arg Thr Leu Asn Tyr Ser Glu 

170 175 • 180 

Asp Ala Tyr Lys Af£ Asp Tyr Phe Leu Ser Lys Thr Leu Glu Cys 

185 190 195 

LSU Pro Ser Asn Thr Glh Thr tet Ser Tyr Leu Asp Ser lie Gin 
10 200 205 210 f 

li& Pro Ser Trp Lys lie Asp Phe Ala Arg Gly Glu lie Lys lie ' 

215 220. , 225 

Ser Pro Gin Ser He Ser Val Ala Lys Ser Leu Leu Lys Leu Asp 

230 235 240 

1 5 Leu Ser Gly He Lys Lys Lys Glu Ser Lys Val Lys Glu Ala Tyr 

245 250 255 

Ala Ser Gly Ser Lys 

260 

(2) INFORMATION FOR SEQ ID NO:8: 

1 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEbNESS: single 

(D) TOPOLOGY: linear 

25 (II) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
TTAACACACT AAGCAAGCAC AAACAATAAA GATAAAGAAA GCTTTATATA 50 
TTTATAQGCT TTTTTATAAT TTAACTIACA GCTGCTITIA AGCAAGTTCT 100 

gtgagttttg ccTurrrm' Aacoccaaac atttcataga acttgttaag 150 
3 0 GGT^TCACTG taatgttcca tagcaatact tocttagca ttaggattgc 200 
tggaqctaag tatagcagca TAcrcrncc ccttcitcac ctgatcttca 250 

TICAITTCAA ATGCTTITCT TTTCAGCACA GTGGAAACTT TTOCTAAGGC 300 

TTOCCTGGTG TCATACrrCT TTGGGTCGAT CCCGAGATCC TTGTATTTTG 350 

CATCCTGATA TATAGOCAAG ACAACACrGA TCATCTCAAA GCTATCAACT 400 

3 5 GAAQGAATAA GAQGTAAGCT ACCTCCCAGC ATTATGGCAA GCCPCACAGA 450 

CTTTGCATCA TCAAGAGGTA ATOCATAQGC TIGACTCAAA GGGTGGGAAG 500 
CAATCTTAGA TrTGATAGTA TTGAGA3TCT CAGAATTCCC AGTTICCTCA -5,50 
ACAAGCCTGA CCCTGATCAA GCTATCAAGC CTTCTGAAGG TGATGTCAGT 600 
GGCTOCAATC CTGTCTGAAG I'lTlUiTiAT GGTAATTTTA CCAAAAGTAA 650 

4 0 AATCGCTTTG CTTAATAAOC TTCATTATGC TCTGACGATT CITCAGGAAT 700 
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gtcagacatg aaataa*gct cATcrrrrrG ivtceqgtgaa. GGrrrrccAG 750 

AGftfc&AAGTC T?fGAMft3A AtfGCEACCAG TCCTCAAACT 800 

ca&3$x^ Gdcf 1 ^^ 850 

TlMfiCAT 858 
5 (&)lNFt}ftMAtl6N^SfedlDN05: 

(i) §£0UIN6£ CHAftACtEftlStlCS: 

(A)L§N8fH: 202§ base pairs 
(6)tV^E: hucibic add 

(OStBANbEDNESS: slnglfc 
10 0)fCmet3V: linear 

(ii) MOLECUL£fYPE: DNA 

(Jd) §EQt)ENG£ bESCftlPTION: SEQ ID NOS: 

AAAf^CTCltf GCAGTSAAA? CTCTGCTCAT GTTAQCAGAA AACAACAICA 50 

$GC£&ACtC TGAAGCTrtS GTCAAAGCTT CTACTGATTC TAATTTCAAG 100 

1 5 CTQAGCCTCr GGCTAAGGGT TCCAAAGGTT TK3AAGCAGA TTTOATTCA 150 

GAAAffiGTiC AAGGTTGCAG GAGATGAAAC AAAIAAAACA TTITATTTAT 200 

cl&FfeocTG cATTxrAAAd cataacagtg ttgagacagc tteaaacatt 250 

ACTGttArrr gcaaqcatcA gctcocaatt cgtaaatgia aaactccttt 300 

TGAAtffcATCA AlteAluMT CTGA3TERAA GGA.GOCTTA.C AACATTATTC 350 

10 ATCATCCITC ATATOCCCAA AGGATTGTTC ATGCTCTGCT TGAAACTCAC 400 

AGAferriTG CAGAAGTTCt TTQCAACAAC TTCCAAGAAG ATGTGATCAT 450 

CTACACCTTG AACAACCATG AGCTAACTCC TGGAAAGTEA GRTTIAGGTG 500 

AAATAACTTT GAATTACAAT GAAGACGOCT ACAAAAGGAA ATATTTCCIT 550 

TCAAAAACAC ftfGAATGTCT TOCATCTAAC ATACAAACEA TGTCTTATTT 600 

2 5 AGACAGGATG CAAATCOCIT CCTGGAAGAT AGACTTTGOC AGGGGAGAAA 650 

TTAAAAiTTC TOCACAATCT ATTTCAGTTG CAAAATCTTT GTTAAATCTT 700 

GATITAAGCG GGATTAAAAA GAAAGAATCT AAGATIAAGG AAGCATATGC 750 

TTCAGGATCA AAATGATCTT GCTGTGTOCA GCTITITCEA A1TATGTTAT 800 

GTTEATTTTC 'ITiUlTiACT TATAATTATT TTTCTGrTTC TGAITILTIT 850 

3 0 CAAATTOCTC CKTCTAGTA GAAAOCAIAA AAACAAAAAT AAAAATAAAA 900 

TAAAATCAAA ATAAAATAAA AATCAAAAAA TGAAATAAAA GCAACAAAAA 950 

AAttAAAAAA CAAAAAACCA AAAAAGATCC CGAAAGGACA ATTTTGGOCA 1000 

AATftGGGGT m?iTlTlGT iTlllUlTlT TITGTTTTTT GT1T1TA1TT ^1050 

TIMTiTiAT 'iTriAl'i'lTl' Al'lTlMTIT ATTTTATGIT TTTCITGTTT 1100 

3 5 TTGTTATTIT GTTATrrATT AAQCACAACA CACAGAAAGC AAACTITAAT 1150 
TAAACACACT TATTTAAAAT TTAACACACT AAGCAAGCAC AAACAATAAA 1200 
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GAIAAAGAAA GUiTlASCAlA iTEATAGGCT ITlTiATAAT ITAACTTACA 1250 

GGTQGJPMA AGCAASlte: GTGAGTTITG CXTbl'lUTiT AJ«X<XAAA.C 1300 

AEftfc&EAGA Mn^fcAAQ GGTPECACTG TAATG1T0CA lEAGCAATACT 1350 

IJOCMaQCA. W3«3GAlEiX3j $GGAGCTAAG TAIAGCAGCA 1400 

5 OC'i/M'lCACS ClyAlbl'lCA. ^fcAirteAA. ATQCJtITKS' ffcltAGCACA 1450 

GC^MACEf ftfcdtiAAGGd 1fftX!C^iX3CTG TX^iACZTTCi? 51GGGTCGAT 1500 

ddOMATCG tfeAWfG GATCCTGAIA TAIAGCCAAG ACAtoCTGA 1550 

!flCA5&C3^ aMKMC* GAAGCAA3AA. GAGGTAAGCT ACCTCCCAGC 1600, 

AlMSGGAA GOCTC&SAGA. CltfttaCATCA. TCAAGAGGTA ATCCAIAGGC 1650 

1 0 ilSAtfid&A GGGTGGGAAG GAATCTTAGA TTTGAiAGTA. I^GAGATDCT 1700 

OVGAAfiXXC A(2mfflCA AGAAGCCIGA COCTGATCAA GCEATCAAGC 1750 

cHOTAAGG *GATG^GASt t GGCTOCAATC CTGTCTGAAG ITITLTITA T 1800 

GGTAAfilTEA (X&AAAGffiAA AAKX3CTTTG CTIAATAACC TTCATTATCC 1850 

IX^GAdsAl* CffiCAGGAAT GTtAGACATG AAAIAATGCT CATCTJLTi'lG 1900 

1 5 ATGfG&CAA GSmTOCAG ACAAAAAGTC TTCAAGTP3A ATGCTACCAG 1950 

A5ffi(^ATCT TOCR^AACi? CAAGGTCTTT GOCTTGTGTC AACAAAGCAA 2000 

cAA3ecMc driAGtosAGC Itaadcat 2028 
(2) INP&ftMATION FOR SEG ID NO:10: 

(i) S£QUENc£ CHARACTERISTICS: 

20 (A) LENGTH: 22 base pairs 

(BJtYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

2 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

1TCTGGTCTT CfTCAAACT CA 22 

(2) INFORMATION FOR SEQ ID NO:1 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 
•3 0 (B)TYP£: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ki) SEQUENCE DESCRIPTION: SEQ ID NO:1 1 : 

3 5 CTGmQOCAT GAGCAAAG 18 

(2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 467 amino acids 

(B) fYP£: ariliho acid 
(6)S1flAN5£bNESS: single 
<B)1tma8V2 linear 

(IOMOLECUL£fYP& peptide 

(*i) SEQUENT bESGftlPTION: SEQ ID N0:12: 



Mat S£r 


Ser 


Gly 


val 


Tyt 


Gltl 


Ser 


He 


He 


Gin Thr Lys Ala Ser 








5 










10 


15 


Val Trp 


Gly 


Ser 


Thr 


Ala 


Ser 


Gly 


Lys 


Ser 


He Val Asp Ser Tyr 








20 










25 


30 


Trp lie 


Tyr 


Glu 


Phe 


Pro 


Thr 


Gly 


Ser 


Pro 


leu, Val Gin Thr Gin 








35 










40 


45 


Leu Tyr 


Ser 


Asp 


Ser 


Arg 


Ser 


Lys 


Ser 


Ser 


Phe Gly Tyr Thr Ser 








50 










55 


60 


Lys He 


Gly 


Asp 


He 


Pro 


Ala 


Val 


Glu 


Glu 


Glu He Leu Ser Gin 
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70 


75 


Asn Vdl 


His 


He 


Pro 


Val 


Phe 


Asp 


Asp 


He 


Asp Phe Ser He Asn 


He Ash 






80 










85 


90 


Asp 


Ser 


Phe 


Leu 


Ala 


He 


Ser 


Val 


Cys Ser Asn Thr Val 


Asn Thr 






95 










100 


105 


Asn 


Gly 


Val 


Lys 


His 


Gin 


Gly 


His 


Leu Lys Val Leu Ser 


Leu Ala 






110 










115 


120 


Gin 


Leu 


His 


Pro 


Phe 


Glu 


Pro 


Val 


Mat Ser Arg Ser Glu 


He Ala 






125 










130 


135 


Ser 


Arg 


Phe 


Arg 


Leu 


Gin 


Glu 


Glu 


Asp He He Pro Asp 








140 










145 


150 


Asp Lys 


Tyr 


He 


Set 


Ala 


Ala 


Asn 


Lys 


Gly 


Ser Leu Ser Cys Val 


Lys Glti 






155 










160 


165 


Ells 


Thr 


Tyt 


Lys 


Val 


Glu 


Mat 


Ser 


His Asn Gin Ala Leu 








170 










175 


180 


Gly Lys 


Val 


Asn 


Val 


Leu 


Ser 


Pro 


Asn 


Arg 


Asn Val His Glu Trp 








185 










190 


195 


Leu Tyr 


Ser 


Phe 


Lys 


Pro 


Asn 


Glu 


Asn 


Gin 


He Glu Ser Asn Asn 








200 










205 


210 


Arg Thr 


Val 


Asn 


Ser 


Leu 


Ala 


Val 


Lys 


Ser 


Leu Leu Met Ala Thr 


Glu Asn 






215 








220 


225 


Asn 


He 


tet 


Pro 


Asn 


Ser 


Gin 


Ala 


Phe Val Lys Ala Ser' 


Thr Asp 






230 










235 


240 


Ser 


His 


Phe 


Lys 


Leu 


Ser 


Leu 


Gin 


Leu Arg He Pro Lys 


Val Leu 






245 










250 


255 


Lys 


Gin 


lie 


Ala 


He 


Gin 


Lys 


Leu 


Phe Lys Phe Ala Gly 


Asp Glu 






260 










265 


270 


Thr 


Gly 


Lys 


Ser 


Phe 


Tyr 


Leu 


Ser 


He Ala Cys He Pro 








275 










280 


285 



Asn His Asn Ser Val Glu Thr Ala Leu Asn Val Thr Val He Cys 

290 295 300 

Arg His Girl Leu Pro lie Pro Lys Ser Lys Ala Pro Phe Glu Leu 

305 310 315 

5 Ser Mat lie Phe Ser Asp Leu Lys Glu Pro Tyr Asn Thr Val His ' 

320 325 330 

Asp Pro Set Tyr Pro Gin Arg He Val HiS Ala Leii Leu Glu Thr 

335 340 345 

His Thr Ser Phe Ala Gin Val Leu Cys Asn Lys Leu Gin Glu Asp 
10 350 355 360 

Val lie lie Tyr Thr He Asn Ser Pro Glu Leu Thr Pro Ala Lys 

365 370 , 375 

Leu Asp Leu Gly GlU Arg Thr Leu Asn Tyr Ser Glu Asp Ala Ser 

380 385 390 

1 5 Lys Lys Lys Tyr Phe Leu Ser Lys Thr Leu Glu Cys Leu Pro Val 

395 400 405 

Asn Val Gin Thr Met Ser Tyr Leu Asp Ser He Gin He Pro Ser 

410 415 420 

Trp Lys He Asp Phe Ala Arg Gly Glu He Arg He Ser Pro Gin 

2 0 425 430 435 
). Ser Thr Pro He Ala Arg Ser Leu Leu Lys Leu Asp Leu Ser Lys 

440 445 450 

He Lys Glu Lys Lys Ser Leu Thr Trp Glu Thr Ser Ser Tyr Asp 
455 460 465 

25 Leu Glu; 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

3 0 (C)STRANDEDNESS: single 

(D)TOPOLOQY: linear 

(il) MOLECULE TYRE: peptide 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Met S 

35 



Met 


Ser Lys Val Lys 


Leu Thr 


Lys 


Glu Asn 


He Val Ser Leu Leu 




5 






10 


15 


Thr 


Gin Ser Ala Asp 


Val Glu 


Phe 


Glu Glu 


Asp Gin Asn Gin Val 


Ala 


20 






•25 


30 


Phe Asn Phe Lys 


Thr Phe 


Cys 


Gin Glu 


Asn Leu Asp Leu He 




35 






40 


45 


Lys 


Lys Met Ser He 


Thr Ser 


Cys 


Leu Thr 


Phe Leu Lys Asn Arg 




50 






55 


60 


Gin 


Gly He Met Lys 


Val Val 


Asn 


Gin Ser 


Asp Phe Thr Phe Gly 




65 






70 


75 
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LyS Val Thr lie Lys Asn Ser Glu Arg Val Gly Ma Lys Asp 

80 85 90 

Met Thr Phe Arg Arg IfeU Asp Ser Met He Arg Val Lys Leu He 

95 100 105 

Gltl Glu Thr Ala Ash Ash Glu Asn Leu Ala lie He Lys Ala Lys 

110 115 120 

116 Ala Ser Ms Pro IMi Val Gin Ala Tyr Gly Leu Pro Leu Ala 

125 130 135 

Asp Ala Lys Ser Val Arg Leu Ala He Met Leu Gly Gly Ser He 

140 145 150 , 

Pro Leu He Ala Ser Val Asp Ser Phe Glu Mat He Ser Val Val 

155 160 . 165 

Leu Ala He Tyr Gin Asp Ala Lys Tyr Lys Glu Leu Gly He Glu 

170 * 175 180 

Pro Thr Lys Tyr Asn Thr Lys Glu Ala Leu Gly Lys Val Cys Thr 

185 190 195 

Val Leu Lys Ser Lys Gly Phe Thr Met Asp Asp Ala Gin He Asn 

200 205 210 

LyS Gly Lys Glu Tyr Ala Lys lie Leu Ser Ser Cys Asn Pro Asn 

215 220 225 

Ala Lys Gly Ser He Ala Met Asp Tyr Tyr Ser Asp Asn Leu Asp 

230 235 240 

LyS Phe Tyr Glu Met Phe Gly Val Lys Lys Glu Ala Lys He Ala 

245 250 255 

Gly Val Ala 

(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEbNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DAN 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

AGAGCAATTG GGTCATTTTT TATTCTAAAT CGAACCTCAA CTAGCAAATC 50 

TCAGAACTGT AATAAGCAcA AGAGCACAAG AGCCACAATG TCATCAGGTG 100 

TTTATGAATC GATCATTCAG ACAAAGGCTT CAGTTTGGGG ATCGACAGCA 150 

TCTGGTAAGT cgatcgtggA TTCTTACTGG ATITATGAGT TTCCMCTGG 200 

TIdrCCACTG GTTCAAACTC AGTTGTACTC TGATTCGAGG AGCAAAAGTA 250 

<3CTr33GCTA CACITCAAaA ATTGGTGATA TICCTGCTGT AGAGGAGGAA 300 

A1TTTATCTC AGAACGTTCA TATCCCAGTG TTTGATGATA TTGATTTCAG 350 

CATCAATATC AATGA1TCTT TCTTGGCAAT TTCTGITTGT TCCAACACAG 400 



ifrAACACCAA TGGAGTGAAG CATCAGGGIC ATCTEAAAGT TCTITCrCTT 450 

GGdoMfec AicccftftGA acctgtgatg agcaggtcag AGATTGCTAG 500 

C^feAffOOGG CttiCAAGAAG A&BAIATAAT TCCTGATGAC AAAIATA1AT 550 

GTft^&AA GAAGG3Mt?i? MtC33GTG TCAAAGAACA $ACTEACAAA 600 

$ GrGMft& o&kMm GGt*nMdGC Aaag%aatg ftmrcrce 650 

iftAMftAAT GBfcaiiJGAG^ <&3C?jtfGlMAG tnXBAACCA AATTTCAACe 700 

AG&IES§AAAG taataagaga MtfGlfcAftrr CTCTtGCAGT CAAKTCTTTG 750 

cMa^GCTA CftGAAAAdAA CA33ATGCCT AACTUKIAAG CTITIGTTAA. 800 

AGMg£AGf GAtefCffi ^KMGTTGAG CCTTTGGCTG AGAA3TCCAA 850' 

1 0 AAGfMGAA GCAAAi£AGCC AiACAGAAGC TCTTCAAGIT TGCAGGAGAC 900 

GAAAG^GGTA AAAGTftCTA ttTGTCTAIT GCATGCATCC CAAATCACAA 950 

CAGli$t5GGAA AC^QCTTTAA. ATGTCACTGT TAIATGTAGA CATCAGCTTC 1000 

CAATCGCEAA GTCCAAAGCT CCTTTTGAAT TATCAATGAT TTTCTCCGAT 1050 

C1GAAAGAGC CTIACAAGAG IGTGCATGAT CCTTCATATC CTCAAAGGAT 1100 

1 5 TGttXMsCT TTGCtflGAGA CTCACACTTC CTTTGCACAA GTTCTCTGCA 1150 

ACAAGGTGCA AGAAGATGTG ATCATATATA CTATAAACAG OOCTGAACTA 1200 

1 ; ACCCCAGCTA AGCTGGATCT AQGTGAAAGA ACCTTGAACT ACAGTGAAGA 1250 

TGC£i£GAAG AAGAAGTATT iTCTTTCAAA AACACTCGAA TGCTTGCCAG 1300 

lAAMSiGCA GACIATGTCT TAT1TX3GATA GCATCCAGAT TOCTTCATGG 1350 

2 0 AAGA^AGACT .ITGCCAGAGG AGAGATCAGA ATCTOCCCTC AATCTACTCC 1400 

TAftiGGAAGA TCTTTSCTCA AGCTGGATIT GAGCAAGATC AAGGAAAAGA 1450 

AGTCXjftdGAC TTGGGAAACA TCGAGCTATG ATCTAGAATA AAAGIGGCTC 1500 

ATAC^CTCT AAGTAGTAIT TGTCAACTTG CTEATCCTTT ATGTTGTTEA 1550 

TT^MTAA ATCTAAAGTA AGTTAGATTC AAGTAGTTTA GIATGCTATA 1600 

2 5 GCAi'iAi'iAC AAAAAATACA AAAAAATACA AAAAAATACA AAAAATATAA 1650 

AAAAOCCAAA AAGATCCCAA AAGGGACGAT TIGGTTGA1T TACTCTGTTT 1700 

tAGGC®CATC TAAGCEGCTT TTGTTTGAGC AAAATAACAT TGTAACATGC 1750 

AATAACTGGA ATlTAAAGTC CTAAAAGMG TTTCAAAGGA CAGCTTAGCC 1800 

AAAAMX3GTT ITiUri'l'l'itS 'mTiTMT T1T1U1T1T1' TTGTTTTATT 1850 

3 0 iiWixriA GTntoiTT UumTlUlT AT1TTIATIT TTATTTTATT 1900 

TiulttaTT TTATTTATAT ATATATCAAA CACAATOCAC ACAAAIAATT 1950 

TTAATTtCAA ACATTCTACT GATTTAACAC ACTTAGCCTG ACTITATCAC 2000 

ACTZAACACG CTlAGITAGG CTTTAACACA CTGAACTGAA TTAAAACACA 2(350 

CTTAG'iAriA TQCATCTCTT AATTAACACA CTITAATAAT ATGCATCTCT 2100 

3 5 GAATCAGCCT TAAAGAAGCT TTEATGCAAC ACCAQCAATC TTCGCCTCTT 2150 
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TdflMCTCC AAACATTTCA TAGAATITGT CAAGATTATC ACTX3TAATAG 2200 - 
TC&S&3CAA TGCTTCCefcT' AQCAiTGQtSl TIGCAAGAAC TAAGTATCTT 2250 

gggMMcF tIxxiotIM! t3M£tgtgo atcMxAet gtaaatcctt 2300 

•ixacMfflMG cAcMScaa acxSMxea GAGctrcciT agtcttgtac 2350 

5 TTASMsrf caatcccMa g%3ctMjac tttgcstc£t 6Sirtatqgc 2400 

imm&ch dfeAte^ci! cMagcS&ec AacAgaagca atgagaggga 2450 

taSEag^tcc AagcMMA gc&AgiEtca cAGATiffiGC atctgccaga 2500 

GQGA&XCX3T AAGOTGGA6 GAAAGGGTGG GRGQCMtTT TTCCTTTGRT 2550 

AATMjAAGA. StocffiGf TTGCAGriCTC TTCTATGAGC TTCACTCTTA 2600 

1 0 TCATGC1ATC AAGCCTCCTG AAAGTCAIAT CCTTAGCTCC 'aACTCTITCA. 2650 

GAAi'HllCT TTATCdTGAC CT&OCAAAA GTAAAATCAC TTTGGTTCAC 2700 

AACffiCATA ATGCCTTGGC GATTdTFCAA GAAAGTCAAA CATGAAGTGA 2750 

TAC^TTTT CTTAATCAGG TCMGATTTT CCTGACAGAA AGTCTEAAAG 2800 

£iEAATGCGA CtTCGT^CTG GTCTlllUCA AACTCAACAT CTGCAGATTG 2850 

1 5 AGTTAAAAgA GAGACAATGT ITILTITIGT GAGCTTGACC TTAGACATGG 2900 

TOGCAGteA GATCTAGACC TTTCTCGAGA GATAAGATTC AAGGTGAGAA 2950 

AGTv&AACAC TGTAGACCGG GGTCGTTACT TATCCTGTTA ATGTGATGAT 3000 

TTtM^QCT GAGTATTAGG TTTTTGAATA AAATTGACAC AATTGCTCT 3049 
(2) INfcbRMATlbN PGR Sfed Iti N0:15: 

20 (I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: Nucleic acid 

(C) STttANDEDNESS: single 

(D) T0^CX3Y: linear 

25 (li) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

ATG CAA CAC CAG CAA. TCT TGG CCT CTT TCT TAA CTC CAA 39 

ACA tTT CAT AGA ATT TGT CAA GAT TAT CAC TGT AAT AGT 78 

CCA TAG CAA TGC TTC CCT TAG CAT TGG GAT TGC AAG AAC 117 

3 0 TAA GTA TCT TGG CAT ATT CTT TCC CTT TGT TTA TCT GTG 156 

CAT CAT CCA TTG TAA ATC CTT TGC TTT TAA GCA CTG TGC 195 

AAA CCT TCC CCA GAG CTT CCT TAG TGT TGT ACT TAG TTG 234 

GTT CAA TCC CTA ACT CCT TGT ACT TTG CAT CTT GAT ATA 273 x . 

TGG CAA GAA CAA CAC TGA. TCA TCT CGA AGC TGT CAA CAG 312 

3 5 AAG CAA TGA GAG GGA TACl TAC CTC CAA GCA TTA TAG CAA 351 

GTC TCA CAG ATT TTG CAT CTG CCA GAG GCA GCC CGT AAG 390 
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. CTT GGA CCA. AAG GGT GQ3 AGG CM TTT TTG CTT TGA TAA 429 

TAG GAA GAT TCT CAT' TGT TTG CAG TCT CTT CTA TGA GCT 468 

TCA GIG TEA. TCA TGC- TAT CAA GCC TCC TGA AAG TCA TAT 507 

CCf ¥AQ CTC CAA Cf(3 ffT CAG AAT TTT TCT TTA TCG TGA 546 

5 CCT TAG" CAA AAG TAA. MS? CAC TTT GGT TCA CAA CTT TCA 585 

TAA. TGCl CTT GGC GAT* TG^f TCA AGA AAG TCA AAC ATG AAG 624 

TGA. TAG TCA TTT TCT TAA TCA GGT CAA GAT TTT OCT GAC 663 

AGA AAG TCT TAA AGT TGA. ATG CGA CCT GGT TCT GGT CTT 702 

CTT GAA. ACT CAA CAf CTG CAG ATT GAG TEA AAA GAG AGA 741 
1 0 CAA TGT TTT CTT TTG TGA GCT TGA CCT TAG ACA TGG 778 
(2) INFORMATION FC»A SEd lb N0:16: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENQTH: 18 base pairs 

(B) TYPfe: Nucleic acid 
15 (C)STttANbfebNESS: Single 

(DJTOPdC&Y: linear 

(li) MOLECULE TYPE: DNA 
(xl) SEQUENGE bESCRIPTION: SEQ ID NO:1 6: 
GrTCTGAGAT TTGCTAGT 18 
2 0 (2) INFORMATION FOR slo" lb N0:17: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANbEbNESS: single 
25 (DJTOPoLC&Y: linear 

(ii) Molecule Type-. dna 

(Xi) SEQUENCE bESCRIPTION: SEQ ID NO:17: 
TTATAXCTTC TTCTTGGA 18 
(2) INFORMATION FOR SEQ ID NO:18: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEbNESS: single 

(D) TORdLOGV: linear 

3 5 00 Molecule TYPE: dna 

(xi) SEQUENCE bESCRIPTION: SEQ ID N0:18: 
ATG TCA TCA GGT GTf TAT GAA TOG ATC ATT CAG ACA AAG 39 
GCT TCA GTT TGG GGA TCG ACA GCA TCT GGT AAG TCC ATC 78 
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GIG GAT TCT 


TAC 


TGG 


ATT 


CCA 61X3 GIT 


CAA 


ACT 


CAG 


MA. Agt agc 


TTC 


GGC 


TAC 


ccr got gta 
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GAG 


GAA 


AlC GTG 


TTT 


GAT 


GAT 


AAf 6at tct 


TlC 


TTG 
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Gif AAg Ace 


AAT 
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GTT 
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m A& AGG 
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GTG AAT GTT 


CTT 


TCT 


OCT 


CTG TAT AGT' 


TTC 


AAA 


CCA 


AAT AAC AGA 


ACT 


GTA 


AAT 
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TTC 
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TIC 
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CAC AAC AGT 
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GAA 


ACA 


TGT AGA CAT 


CAG 


CTT 
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TTT GAA TTA 


TCA 


ATG 


ATT 


TAG AAC ACT 


GTG 


CAT 


GAT 


GTT CAT OCT 


TTG 


CTT 


GAG 


GTT CTC TGC 


AAC 


AAG 


CTG 


ACT ATA AAC 


AGO 


OCT 


GAA 


CTA GGT GAA 


AGA 


ACC 


TTG 


AAG AAG AAG 
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TTT 


CTT 
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CTG 


GAT 
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TCC TTG ACT 


TGG 


GAA 


ACA 
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TTT 
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TCT 


GAT 


TOG 


AGG AGC 
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TCA 


AAA 


ATT 


GGT 


G&T ATT 
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TCT 


CAG 


AAC 


GTT CAT 
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GAT 


TTC 


AGC 
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j&T ATT 


£. 1 Z) 
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TCT 


GTT 


TGT 


TCP 


AAP APA 


OXZ 


Aag 


CAT 


CAG 


GGT 


PAT 


PTT AftA 


OOX 


TTG 


CAT 


rrr 


TTT 1 

XX X 


HAA 


PPT f*TT 


oyu 


GCT 


AGC 


AGA 


TUP 
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G5VT 


rap 


AAA 


TAT 




i 

X\>X oUI 


468 


TCC 


TGT 


GTC 


AAA 


f5AA 


Pat 7\r*r 


cm 
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AAT 


CAG 
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TTA 

X Xrl 


Q^P AAA 


OHX> 


AAC 


AGA 
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PAT 


f3AQ TPC 
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ATP 
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CTT 
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TPT M T IY > 
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AAC 


ATT 
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CTT 


AAP 
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f\JZ 
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CAT 
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AAA 


GTT 


TTG 
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TTT 


GCA 


GGA 
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TCT 
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ATT 


OCA AAT 


858 
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TTA 


AAT 


GTC 
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GTT ATA 


897 


ATC 


CCT 


AAG 


TCC 


AAA 


GCT CCT 


936 


TTC 


TCC 


GAT 


CTG 


AAA 


GAG CCT 


975 
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TCA 
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AGG ATT 


1014 
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ATA TAT 1092 
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TAP 
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P7A7V 
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GAA 
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ATG 
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AGC ATC 1248 
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AAG AAG 1365 
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TAT 


GAT 


CTA 


GAA 1401 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: Nucleic acid 

(C) SfRANDETJNESS: single 
5 (D)1t3P6Le§V: llh§ar 

(li) MOLECULE WE: bNA 

SEQUENCE t3g§ORlPTlON: SEQ lb NO:1S: 

ATG TCT? AAG GTC AAG &C ACA AAA GAA AAC ATT GTC TCT CTT TEA 45 

ACT CAA TCT GCA GAtT GTT GAG TIT GM GAA GAC CAG AAC CAG GTC '90 

10 GCA TTC AAC TTT AAG ACT TEC TGT CAG GAA AAT CTT GAC CTG ATT 135 

AAG AAA ATG AGT ATC AtiT TCA TGT TTG ACT TTC TTG AAG AAT CGC 180 

CAA GGC ATT ATG AAA GTT GTG AAC CAA AGT GAT TTT ACT TIT GGT 225 

AAG GTC ACG ATA AAG AAA AAT TCT GAA AGA GTT GGA GCT AAG GAT 270 

ATG ACT TTC AGG AGG CTT GAT AGC ATG ATA AGA GTG AAG CTC ATA 315 

15 GAA GAG ACT GCA AAG AAT GAG AAT CTT GCT ATT ATC AAA GCA AAA 360 

ATT GCC TCC CAC CCT TTG GTC CAA GCT TAC GGG CTG CCT CTG GCA 405 

GAT GCA AAA TCT GTG AGA CTT GCT ATA ATG CTT GGA GGT AGT ATC 450 

CCT CTC ATT GCT TCT GTT GAC AGC TTC GAG ATG ATC AGT GTT GTT 495 

CTT GCC ATA TAT CAA GAT GCA AAG TAC AAG GAG TTA GGG ATT GAA 540 

20 CCA ACT AAG TAC AAC ACT AAG GAA GCT CTG GGG AAG GTT TGC ACA 585 

GTG CTT AAA AGC AAA GGA TTT ACA ATG GAT GAT GCA CAG ATA AAC 630 

AAA GGG AAA GAA TAT GCC AAG ATA CTT AGT TCT TGC AAT CCC AAT 675 

GCT AAG GGA AGC ATT GCT ATG GAC TAT TAC AGT GAT AAT CTT GAC 720 

AAA TTC TAT GAA ATG TTT GGA GTT AAG AAA GAG GCC AAG ATT GCT 765 

25 GGT GTT OCA TAA 777 

(2) INFORMATION FOR SEQ ID NO20: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: Nucleic acid 
3 0 (Cj STRANDEDNESS: single 

(D)TC£dOG\>: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO20: 

TACTTATCTA GAACCATQGA CAAAGCAAAG ATTACCAAGG 40 

3 5 (2) INFORMATION FOR SEQ ID NO:21 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 
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(B) TYPE: Nucleic acid 

(C) STRANbEDNESS: single 

(D) tO!^OGY: linear 

(10 MOLECULE TYPE: DMA 
5 W§eaU&N6ib£§6ttl^tlON:SEQiDN02l: . 

^CAG^KSGAtf CGAlfeGTEAT TICAAAIAAT TEAIAAAAGC AC 42 

(2) INFORMATION FOR SEQ Id N022: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH! 36 base pairs 
10 (B)TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPoLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N022: 
1 5 AocarK3GAT ccatggtiaa CACACTAAGC AAGCAC 36 

(2) INFORMATION FOR SEQ ID N023: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: Nucleic acid 
20 (C) STRANDEbNESS: single 

PJTOPOLOGV: linear 

(iO MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N023: 
AGCTAATCTA GAACCATGGA TGACTCA.CTA. AGGAAAGCAT TGTTGC 46 

2 5 (2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

3 0 (D) TOPOLOGY: linear 

(Ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N024: 
CCCACTATCC TTCGCAAGAC CC 22 
(2) INFORMATION FOR SEQ lb NO:25: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOP0LOGV: linear 
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(Ii) MOLECULE TYPE: DMA 

(xt) SEQUENCE &ESG&IPTION: SEQ ID N025: 

TBCAi&GG&r OCATGGrEAA GGIAATOCAT AGGCTTGAC 39 

(2) INFORMATION Pm SEQ lb N026: 

5 (I) SEQUENCE CHAfUCTEfilSilCS: 

(A) LENGtH: 40 base pairs 

(B) TYr*& Nucleic acid 

(C) STRANDEbNESS: single 
p)1UPbLOGY: linear 

10 (ii) MOLECULE TYPEi DNA 

(Xi) SEQUENCE bESCtilPTION: SEQ ID N026: v 

AGCTAACCAT GGfrS&GCTC ACTAAGGAAA GCATIGTTGC 40 

(2) INFORMATION FOR SEQ lb N027: 

(i) SEQUENCE CHARACTERISTICS: 

1 5 (A) LENGTH: 46 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA. 

2 0 (*i) SEQUENCE bESCRIPTION: SEQ ID N027: 

AQC1AATCTA. GAACCAIGGA. TGRCTCACm AGGAAAGCAT TGTTGC 46 
(2) INFORMATION FOR SEQ ID N0:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N028: 

3 0 AGCATTGGAT OCATGGTIM CACA.CTAAGC AAGCAC 36 

(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic acid 
3 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

TACAGTTCTA gaaccatgga TGATGCAAAG TCTGTGAGG 39 
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(2) INf=ORMAT!ON FOB §Et2 lb NO:30: 

(0 SEQUENCE 5HABAOtEfilSTICS: 

(A) LENStM: 49 base pairs 
(&)tYPfe Nucleic acid. 

5 (d)StflANbEbNESS: single 

(D)Td%Ld&Y: linear 

0l)M6L£fcUL£Wfe DNA 

(*l) S£aUENfc£ bE§6RIPTION: SEQ ID NO:30: 

AG&ftGftM q^ccAtgctg ACTTGATGAG CftAAGTCTGT GAGGCTTGC ,49 
1 0 thus While W8 hav@ illustrated and described the preferred 

embodiments of our invention, it is to be Understood that this invention 
is capable bf variation grid modification, and We therefore do not wish 
td be limited td the precise terms set forth, but desire to avail 
ourselves bf such changes and alterations which may be made for 
1 5 adapting th§ inventidh td various usages and conditions. Such 
variations and modifications, for example, would include the 
substitution of structurally similar nucleic acid sequences in which the 
difference between th6 sequence shown and the variation sequence is 
suofi that little if arty advantages are available With the variation 
20 Sequence, i.e. that the sequences produce substantially similar results 
as described above, thus, changes in sequence by the substitution, 
deletion, insertion or addition of nucleotides (in the nucleotide 
sequences) or amino acids (in the peptide sequences) which do not 
substantially alter the function of those sequences specifically 

2 5 described above are deemed to be within the scope of the present 

invention. In addition, It is our intention that the present invention may 
be modified to join the N genes of various isolates that provide 
resistance or Immunity to Tospovirus infection of plants according to 
the present invention into a single cassette, and to use this cassette as 
30 a transgene In order to provide broad resistance to the Tospoviruses, 
especially td tSWV-BL t tSWV-B, and INSV. Accordingly, such changes 
and alterations are properly Intended to be within the full range of 
equivalents, afid therefore within the purview of the following ^claims. 
Having thus described bur invention and the manner and a process 

3 5 of makihg and Using it In such full, clear, concise and exact terms so as 
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to enable any person skilled in the art to which it pertains, or with 
whicii it is most nearly connected, to make and use the 'same; 
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We Claim: 

1. An isolated nucleotide sequence Which is Selected from the 

&Pt&fgfiii$ GGte ffljAEg $etgctcat gttagcagaa Mcaacatca 50 

lx3@^Adrd §fGftftAocrr CTAdiGATTc ^AArrrcAAG 100 

c?i?@K§§SRj3? G^dlMQil^ t^JGaaagstt ttgaaqcaga tttccattca iso 

(^1^^^ M^OTQ<^S ^^itelA^c AMift^G^ fri^ife 200 

c'M^teiS gAMgaaag 1 (^McAfffG ttgWcAgC t13aa&catt 250 

AtJMlpAffi 08a^g&$ca fficffi cgeAAMxjE^ MAcrccrrr 300 

fi^^MlA AS^QEDtt Oi\MU J iMA GGAGCcfiAc McattAttc 350 ' 

ffiffi^A j^GATTGT"rc atgctctgc¥ tgaaactcac 4oo 

AGMOTtTQ CACA&lfficr* TTGCAACAAC TTGCAAGAAG ATGTGATCAT 450 

criMMSs M^M MMAcrce tggaaagtta gatttaggtc Soo 
M^^ateff oSMtiAiSAM? gAAGaogcct acaaaaggaA Atatttcctt 550 

TCAAfcAACAC IfSAAfflcf T<XATCTAAC ATACAAACTA TGTC1TAT1T 600 

AGAcMdArd cftAATtixfrt cctggaagat agactitqcc aggggagaaa 650 
riAAAArrrc TcmcAATcf atttgagttg caaaatcttt gttaaatctt 700 

GWIMOOG Q(M!ftAAAA GAAAGAATCT AAGATTAAGG MGCATATGC 750 
TT\jft§jftrdA AAATGAfCTT' GCTGTGTCCA GCT1TITCTA ATTATGTTAT 800 

Gil f M^iTC I'ltefLRdi? tataattAtt titctgtttg tgatttcttt 850 

h O&ATTCCTC CftjttjTAGTA GAAACCAIAA AAACAAAAAT AAAAATAAAA 900 
TAAAATCAAA ATAAAATAAA AATCAAAAAA TGAAATAAAA GCAACAAAAA 950 
AAWAAAAA CA AAftftAft ft AAAAAGATCC CGAAAGGACA ATTTTGGOCA 1000 
AAl fflggG gr ffcfil'I'H^ ll'l^lUlTiT TITGTTTTIT GTTITTATTT 1050 
I'M MM I'Mffii' MllT&TiTi' ATITEATGIT 'lTlUHUm' 1100 

tMJM'M 1 Gi'JAIPrjMr aagcacAaca cacagaaagca aactttaat 1150 

TAAA<&CACT T Ate AAAAf TTAACACACT AAGCAAGCACA AACAATAAA 1200 
GAtfAAAGAAA GOItLATATA TTTATAGGCT TTITTATAAT TTAACTTACA 1250 
GCTG^frlA AGCAAGT^CT GTGAGTTITG CCTGTTTTTT MCCCCAAAC 1300 
AT lgM AGA ACtTGTTAAG GGTTTCACTG TAATGTTCCA TAQCAATACT 1350 
TOCfMOCA TtAGGATTGC TGGAGCTAAG TATAGCAGCA TACTCTTTCC 1400 
CCTTdTTCAC CTGATCTTCA TTCAITTCAA ATGCTTTTCT TITCAGCACA 1450 
GTGCAAACTT TTCCTAAGGC TTCOCTQGTG TCATACTTCT TTGQGTCGAT 1500 
CGCGAGATCC Ita/ft'lllU CATCCTGATA TATAGCCAAG ACAACACTGA 1550 
TCATCTCAAA GCTATCAACT GAAGCAATAA GAGGTAAGCT ACCTCCCAGC 1600 
ATTATGGCAA GCCfCACAGA cWtGCATCA TCAAGAGGTA ATCCATAGGC 1650 
TfiSAATtAAA GGGTGGGAAG CAATCTTAGA TTTGATAGTA TTGAGATTCT 1700 
CAGAATTCC 1709; 

pnteAtea: aagcaa gcac aaacaataaa gataaagaaa gctttatata 50 

rrrATAGGCT rftTTATAAT TTAACTTACA GCTGCITTTA AGCAAGTTCT 100 

GTGAGTTITG Cdfbl'M'lT AACCCCAAAC ATITCATAGA AClTGITAAG 150 

ggtttgIactg taatgtTcca tagcAAtAct TCCTTTAQCA 1*EAGGATTGC 200 

TGC&GCTAAG TATAGCAGCA tACTCTTTCC CCTTCTTCAC CTGATCTTCA 250 

TTCATrrCAA ATGC3TTTCT 'TITCAGCACA GTGCAAACTT TTOCTAAGGC 300 

TICCCTGGTG tCATACTTCT TTGGGTCGAT CCCGAGATCC TTGTATTTTG 350 
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CAtfdGRSA^ ^tAS&GdC&AG AC&ACACTGA TCATCTCAAA QCTAICAACT 400 

GAaee^^t^Ma^ 450 

CJflC^g^Ca. imft^^ liS^ec^EAQQC; tTGACfGRAft. GQCTQGGARG 500 

550 

AGAA^MaA. OMMfM SStA5xAA.dC CTt^EGAAGG fCftlXJFCAGT 600 
G^S^dfMi^ 650 

Gft&£AC&tG m&kPfl&S attflS MXHX3GTCAA. GGTTTiXXXG 750 
ACMAAAgtC ¥tfGAA$3iGA A3X3CEACCAG MTCTGATCT TXX3XAAACT 800 . 
CASMSAf &x3Mt!i£ AACAAAGCAA CAATGCTTTC CHaGTGAGC 850 

AAfiMXiW? GC&GtfQAAAi CTCTGCTCAT GTIAGCAGAA AACAACATCA 50 

TO^ftCTXi ttAAGCSffii? GTPCAAAGCIT CEAC3X3A3TC TAA3TTCAAG 100 

COSAGdCW GG$MGGG7i? TKXAAAGGTT TIGAAGCAGA OTOCRTKA 150 

GAAA9ffiQTi£ AAGGTflGCAd GAGATGAAAC AAATAAAACA TTTrAITEAT 200 

GfiAffldCm GASHXX3AAAd CA3AACAGTG TIGAGACAGC TTTAAACATT 250 

kj MM'M GCAAGC^CA GCTCOCAATT CGTAAATGTA AAACTCCTTT 300 

W&F&tSCk flfiMtf CXGATTEAAA GGAGCCTmC AACATEATTC 350 

AflSA^CttX} Al&ICXXJGAA AGGATTGTTC ATGCTCTGCT TGAAACTCAC 400 

ACAftMfrX* (^OAGl*Cf ¥it3CAACAAC TIGCAAGAAG ATGTGATCAT 450 

C1ACADCTTG AACAACCATG AGCTAACTQC TGGAAAGTIA GATTTAGGTG 500 

AAA^ATTTl? GAAri£C&Af GAAGAGGCCT ACAAAAGGAA ATATTTCCTr 550 

TCAAARACAC i^GAAMfctf IXXATCTAAC ATACAAACTA TGTCTTATTr 600 

AGAG^GCATG CAAAttcfT CCTGGAAGAT AGACTTTGCC AGGGGAGAAA 650 

t*£AAAAta3' IXXAd^CT AffiTCAGTTG CAAAATCTIT GTEAAATCTT 700 

GAll^AAGCG GGATftAAAAA GAAAGAATCT AAGATEAAGG AAGCATATGC 750 

ttXAGGATCA AAA»3A3£1T GCTX3TGT0CA GClTlTlCrA ATTATGTIAT 800 

b'MMC fflt'l ' M CT !EATA7ATEA!TT TTTCTGTTTG TCATTTCTTT 850 

CAAMSCdTC d£flX2AG4A GAAA.CCATAA AAACAAAAAT AAAAATAAAA 900 

tAAAA!fCAAA aIAAAASAAA. AATCAAAAAA TGAAATAAAA GCAACAAAAA 950 

AAT^tAAAAAA CAAAAAACCA AAAAAGATCC CGAAAGGACA ATTTTGG OCA 1000 

AAfflGGGGf tlWl'M'l GT &T1V1T1T TITGTTTriT GTITCTATTT 1050 

I ' iMl'l ' i ' rA T ' M ' lMl'M' A ' riTiAlTIT ATITEATGTT TTTGTrGTTT 1100 

TlUllMi'lT Gl'lAl'l'lA TT AAGCACAACA CACAGAAAGC AAACTTTAAT 1150 

TAAAGACACT TATTlAAAAT ttAACACACT AAGCAAGCAC AAACAATAAA 1200 

GATAAAGAAA GC^TEAIATA ITrATAGQCT TTTTEATAAT TTAACTTACA 1250 

GCTX3C¥frm AQCAAGTiXTT GTGAGTrrTG CCTbTmiT AACCC3CAAAC 1300 

ATff^ASAGA AC^IXS^AAG GGTnEACTG TAATGTTOCA TAGCAATACT 1350 

TXXi^lftAGCA l*ftGGASSGC > TQGAGCEAAG TATAGCAGCA TACTCTITCC 1400 

C CriCniA C CiGATCTFCA TKAriTCAA ATGCTITTCT TITCAGCACA 1450 

GTGdAAAClT rftxaAAGGC TKXXTGGTG TCATACTTCT TTOGGTOGAT 1500 

CCCG&3ATCC ll b 'lAl ' iTi G CATCCTGATA TATAGCCAAG ACAACACTGA 1550 

TtAtt&CAAA GC^IATCAACT GAAGCAATAA GAGGTAAGCT AOCTOCCAGC 1600 

ATTA5B3GCAA GOCTCACAGA CTTTGGATCA TCAAGAGGTA ATCCATAGGC 1650 
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£fGAdfcAAA gggtgggaag caAicrrAGA tttgatagtca ttgagattct 1700 
cwsAArrocc AraffittjOfcA. AcAAgociga occKsATcAA gciatcaagc 1750 

C^ gagG fCA^M? GGCftxAATC CiSft^GAAG 'rmUTl ' JH T 1800 

GG^ftM^M^ <3cAAAA13iAa AAta3dffiTC CTIAMMCC itoOTMXX 1850 

tfgftlAGsAffi g%c&1gAA$ (SfiG^S^CMG Aaata&tgct CAMiTlTG i900 

AfSfGGftAA GSM$6cft© AgAAAAAGTC iTGAAGTDGA AteCTACCAG 1950 

M5»A««^^ 2000 
GAA$§Me mA GmGC frAACCAT 2028,' Stl 

A(&GGAAift3 GG^CAf^Pfl? SAijfttfiAAAT CGAAGCTCAA CTAGCAMTC 50 

TC^BAAtSG 1 !? AA3AAGGAGA AgAGCACAAG AQCXACAATG TCATCAGGTG 100 

ISMgAMC GAlxAflCAG AGAAAGGCrr CAGTTTGQGQ ATCGACAGCA 150 

MGG^AAG^ (XA!KJGl?GGA £ft?I^CTGG ATTTATGAGT TICCAACIGG 200 

ti$J$GAdi£ GlttAAASfc AGftxSACrc tgattcgagg Agcaaaagta 250 

GMSgGCfEA. GAcEKAAAA ASlGGTGAiA TKCTSCTGT AGAGGAGGAA 300 

MTFiJAMC AGAAOGftXA SA5XXXAGTG TTTGATGATA TTGATTTCAG 350 

C&KIAATAIC AA3X3AtelT teETGGCAAT TTCTGTTTGT TCCAACACAG 400 

TlAACAQCAA fl3GAGl?GAAG CAtC&GGGTC ATCmAAGT TCTTrCTCTT 450 

GO^SAtodGC A3XXXS^TGA ACCTGTGATG AGCAQGTCAG AGAITGCEAG 500 

CAGAtfflOCGG drOCAAGAAG AAGATAIAAT TOCTGATGAC AAATATAIAT 550 

CTGC'ltoClM. CAAGGGATCT CTCTCCTGTG TCAAAGAACA TACTTACAAA 600 

GTtteAAATGA. GCCACAATCA GGCTTTAGGC AAAGTCAATG TTCTTTCTCC 650 

TAAGAGAAA!r GtKAfGAG* GGCTGTAIAG TTTCAAACCA AA3TTCAACC 700 

AGAfrOGAAAS ^AAOMgAgA ACSDGTAAATT CTCTTGCAGT CAAA1CTTTG 750 

CteAftSGClA GAGAAAACAA CAiETATGCCT AACTCrcAAG LUlTiUlTAA 800 

AGG^fftnAC^ GAiftC^Mfr fiAAGTTGAG CX^tSgGCTG AGAATTCCAA 850 

AAsffiTGAA &aaa!eaqcc AiAcagaagc TCITGAAGIT TGCAGGAGAC 900 

GAAAOCQGTA AAAGMctA teGTCTATT GCATGCATCC CAAATCACAA 950 

CAGl?GlGGAA ACAGdflTAA ATGTCACTGT TATATGTAGA CATCAGCITC 1000 

caAtocctAA GftxAAAQCT ccrrrrGAAT tatcaatgat tttctccgat 1050 

CTGAAAGAGC CMCAACAC feTGCATGAT CCTTCATATC CTCAAAGGAT 1100 

TGfit&TCCf S*tQc£ft3AGA CICACACTTC CTTTGCACAA. GTTCTCTGCA. 1150 

acaAgctgcA Agaasa4gtg Aieataiata CTAIAAACAG CCCTGAACIA 1200 

AGCCCAGCTA AGCTGGATCT AGGTGAAAGA AOCTTGAACT ACAGTGAAGA 1250 

TGCTtCGAAG AAGAAG^ATl? TK3TTCAAA AACACTCGAA TGCTTGCCAG 1300 

TAAAteTGCA GACTATGTCT 1A3TIGGATA GCATOCAGAT TCCTTCATGG 1350 

AAGATAGACf IftGCCAGAGG AgAGATCAGA ATCTCCCCTC AATCTACTCC 1400 

TAITGGAAGA KJlTtt^JTCA AQCTGGATTT GAGCAAGATC AAGGAAAAGA 1450 

AGMIjftGAC I'tGGGAAACA fCCAGCTATG ATCTAGAATA. AAAGTGGCTC 1500 

ATAc^cfrcf AAsbAGlAa* MtAAcrrG cTTATccrrr atgitcitea i5so 

UlllM'iTAA A^AAAGTA AGTEAGATTC AAG1AGTITA GTATQCTATA 1600 

GCAT3A1TAC AAAAAAmcA AAAAAAIACA AAAAAAIACA AAAAATATAA 1650 

AAAAOCCAAA AAGA3XXXAA AAGGGACGAT TTGGTTGATT TACTCTGTTT 1700 

TAGGdrTATC IAAGCTQCIT 1TGTITGAGC AAAATAACAT TGTAACATGC 1750 

AATAACTGGA ATTTAAAGTC CEAAAAGAAG TTTCAAAGGA CAGCTEAGCC 1800 
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AAAAffiGsrr i i iTbl i H i i'T 6 l'l'i'iTnurr titiuitut ttgttitatt 1850 
gflM'ri ' i'JA bu'rlMTM' M'm^rr MTi'iTMTr ttattteajt 1900 
Mliif ai&iMcaaa (Mmc acaaataatt 1950 

iySfflm AO^ftB^ £AfMA&C ACttAGdC?iX3 ACTTTATCAC 2000 

A^ACACfe C?flA(M^ fcfMMCACA CT&ACTG&A 'TIAAAACACA 2050 

afiftgfc^^GCAl^^ 2100 

GA&fl&^3!A^^ 2150 

fg^f^ AAAC&P^ 2200 

ffiMfc^ fe#iSM¥ A^^ 2250 

dQSMMa? i fK30ffix3¥ tt&fcS&lGC ATCATXX^rr GTAAATCCIT 230d 

TftMrAfo 2350 

•fflffiGGitf GAATddtMA CKX^ftX3EAC TITGCATCIT dATATATGGC 2400 

Aftl&AfcAACA CIX3A*CA^dr CGAAQCTGTC AACAGRAQCA ATGAGAGQGA 2450 

lA£l5&l*Crce AAGCATIATA GCAAGTCTCA CAGATITO3C ATCTQCCAGA 2500 

Q(^GCOOGT AAGCCffelGAC CAAAGGGTGG GAGGCAATTT TTGCTITGAT 2550 

AftiAdCAAGA l^O^TiXlf WQCAGTCrC TTCTATGAGC TTCACTCTEA 2600 

1XM£l^' P^XSeCffi AAAGTCATAT COTAGCICC AACTCTITCA 2650 

GAAlfffiCr TlATdGlmC CI^ADCAAAA G1AAAATCAC TTTGGtrCAC 2700 

AAfflfcATA Att3aM3GC GAttXTTCAA GAAAGTCAAA CATGAAGTGA 2750 

TAlSDSAlM dttCAAtCAGG itAAGATTITT CCTGACAQAA AGTCTTAAAG 2800 

Tilj3ftAlX3CGA CCTGGMdtG GTCTKOTCA AACTCAACAT CTGCAGATTG 2850 

A(2f$AAAAGA GAGACAATGT TTTCTTITGT GAGCTTGACC TIAGACATGG 2900 

TGGGAGWtA GATCMGACd TTKntXSAGA GATAAGATTC AAGGTGAGAA 2950 
AGEGGAACAC 1GIAGA.0CGC GGTCGTEACT TATCCTGTIA ATGTGATGAT 3000 
ri^lMTGCfr GAGTftffAGG l^l'l ' l GAATA AAATTGACAC AATTGCTCT 3049 

2. A plant Susbeptible to infection by Tospovituses which has a 
traftsgene inserted ihtd its genome to render it resistant to infection by 
Tospoviruses, said transgene being selected from the group consisting 
of the nucleoprotein g§he of TSWV-BL, TSWV-toW, INSV-LI, TSWV-B, a 
ToSpOVitUS , said tranche consisting of partial or full length 
nuclebprotein gene sequences from TSWV-BL, TSWV-iOW, TSWv-B, 
iNSV^Beg and INSV-ILj the translatable or untranslatable sequences of 
said hucleoprotein gene sequences, and the sehse or antisense 
sequences of said nuclebprotein gene sequences. 

3. A friethdd tof providing a host plant With resistance to 
infectldn by tospd\rltU§§§ which comprises inserting a transgene into 
the host plant Whlbh g"§h§ Is selected from the nucleoprotein gene of 
fSWV-BL; TSWAM&W, IMSV-Beg, INSV-LI, TSWV-B, or mixtures of 
nucleotide sequences taken from the nucleoprotein gene. 
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FAMILY NAME 
Pang 



FIRST GIVEN NAME 
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CITIZENSHIP 
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statements were made with the knowledge that willful false statements and the like so made 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Gonsalves , Dennis 
Pang , Sheng - Zhi 

(ii) TITLE OF INVENTION: TOMATO SPOTTED WILT VIRUS 

(iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Nixon Peabody LLP 

(B) STREET: Clinton Square, P.O. Box 1051 

(C) CITY: Rochester 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 14603 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/495,484 

(B) FILING DATE: 27-JAN-1994 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Goldman, Michael L. 

(B) REGISTRATION NUMBER: 30,727 

(C) REFERENCE /DOCKET NUMBER: 19603/10303 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (716) 263-1304 

(B) TELEFAX: (716) 263-1600 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



AGCAGGCAAA ACTCGCAGAA CTTGC 



25 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
GCAAGTTCTG CGAGTTTTGC CTGCT 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
AGCTAACCAT GGTTAAGCTC ACTAAGGAAA GC 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 
AGCATTCCAT GGTTAACACA CTAAGCAAGC AC 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2216 base pairs 

(B) TYPE : nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CAAGTTGAAA GCAACAACAG AACTGTAAAT TCTCTTGCAG TGAAATCTCT GCTCATGTCA 60 

GCAGAAAACA ACATCATGCC TAACTCTCAA GCTTCCACTG ATTCTCATTT CAAGCTGAGC 12 0 

CTCTGGCTAA GGGTTCCAAA GGTTTTGAAG CAGGTTTCCA TTCAGAAATT GTTCAAGGTT 18 0 

GCAGGAGATG AAACAAACAA AACATTTTAT TTATCTATTG CCTGCATTCC AAAC CAT AAC 24 0 

AGTGTTGAGA CAGCTTTAAA CATTACTGTT ATTTGCAAGC ATCAGCTCCC AATTCGCAAA 3 00 

TGCAAAGCTC CTTTTGAATT ATCAATGATG TTTTCTGATT TAAAGGAGCC TTACAACATT 36 0 

GTTCATGACC CTTCATACCC CAAAGGATCG GTTCCAATGC TCTGGCTCGA AACTCACACA 42 0 

TCTTTGCACA AGTTCTTTGC AACTAACTTG CAAGAAGATG TAATCATCTA CACTTTGAAC 48 0 

AACCTTGAGC TAACTCCTGG AAAGTTAGAT TTAGGTGAAA GAAC CTTGAA TTACAGTGAA 54 0 

GATGCCTACA AAAGGAAATA TTTCCTTTCA AAAACACTTG AATGTCTTCC ATC TAACACA 6 00 

CAAACTATGT CTTACTTAGA CAGCATC CAA ATCCCTTCAT GGAAGATAGA CTTTGCCAGA 66 0 

GGAGAAATTA AAATTTCTCC ACAATCTATT TCAGTTGCAA AATCTTTGTT AAAGCTTGAT 72 0 

TTAAGCGGGA TCAAAAAGAA AGAATCTAAG GTTAAGGAAG CGTATGCTTC AGGATCAAAA 780 

TAATCTTGCT TTGTCCAGCT TTTTCTAATT ATGTTATGTT TATTTTCTTT CTTTACTTAT 84 0 

AATTATTTCT CTGTTTGTCA TCTCTTTCAA ATTCCTCCTG TCTAGTAGAA AC C AT AAAAA 900 

CAAAAAATAA AAATGAAAAT AAAATTAAAA TAAAATAAAA TCAAAAAATG AAATAAAAAC 96 0 

AACAAAAAAT TAAAAAACGA AAAACCAAAA AGACCCGAAA GGGACCAATT TGGCCAAATT 102 0 

TGGGTTTTGT TTTTGTTTTT TGTTTTTTGT TTTTTATTTT TTATTTTATT TTTATTTTAT 108 0 

TTTATTTTTA TTTTATTTTT ATTTTATTTA TTTTTTGTTT TCGTTGTTTT TGTTATTTTA 114 0 

TTATTTATTA AGCACAACAC ACAGAAAGCA AACTTTAATT AAACACACTT ATTTAAAATT 12 00 

TAACACACTA AGCAAGCACA AGCAATAAAG ATAAAGAAAG CTTTATATAT TTATAGGCTT 12 6 0 

TTTTATAATT TAACTTACAG CTGCTTTCAA GCAAGTTCTG CGAGTTTTGC CTGCTTTTTA 132 0 

ACCCCGAACA TTTCATAGAA CTTGTTAAGA GTTTCACTGT AATGTTCCAT AGCAACACTC 13 80 

CCTTTAGCAT TAGGATTGCT GGAGCTAAGT ATAGCAGCAT ACTCTTTCCC CTTCTTCACC 144 0 
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TGATCTTCAT TCATTTCAAA TGCTTTGCTT TTCAGCACAG TGCAAACTTT TCCTAAGGCT 150 0 

TCCTTGGTGT CATACTTCTT TGGGTCGATC CCGAGGTCCT TGTATTTTGC ATC CTGATAT 1560 

ATAGCCAAGA CAACACTGAT CATCTCAAAG CTATCAACTG AAGCAATAAG AGGTAAGCTA 162 0 

CCTCCCAGCA TTATGGCAAG TCTCACAGAC TTTGCATCAT CGAGAGGTAA TCCATAGGCT 168 0 

TGAATCAAAG GATGGGAAGC AATCTTAGAT TTGATAGTAT TGAGATTCTC AGAATTCCCA 174 0 

GTTTCTTCAA CAAGCCTGAC CCTGATCAAG CTATCAAGCC TTCTGAAGGT CATGTCAGTG 18 0 0 

CCTCCAATCC TGTCTGAAGT TTTCTTTATG GTAATTTTAC CAAAAGTAAA ATCGCTTTGC 18 6 0 

TTAATAACCT TCATTATGCT CTGACGATTC TTTAGGAATG TGAGACATGA AATAACGCTC 192 0 

ATCTTCTTGA TCTGGTCGAT GTTTTCCAGA CAAAAAGTCT TGAAGTTGAA TGCTACCAGA 198 0 

TTCTGATCTT CCTCAAACTC AAGGTCTTTG CCTTGTGTCA ACAAAGCAAC AATGCTTTCC 2 04 0 

TTAGTGAGCT TAACCTTAGA CATGATGATC GTAAAAGTTG TTATATGCTT TGACCGTATG 210 0 

TAACTCAAGG TGCGAAAGTG CAACTCTGTA TCCCGCAGTC GTTTCTTAGG TTCTTAATGT 216 0 

GATGATTTGT AAGAC TGAGT GTTAAGGTAT GAACACAAAA TTGACACGAT TGCTCT 2216 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

AAATTCTCTT GCAGTGAAAT CTCTGCTCAT GTTAGCAGAA AACAACATCA TGCCTAACTC 6 0 

TCAAGCTTTT GTCAAAGCTT CTACTGATTC TAATTTCAAG CTGAGCCTCT GGCTAAGGGT 12 0 

TCCAAAGGTT TTGAAGCAGA TTTCCATTCA GAAATTGTTC AAGGTTGCAG GAGATGAAAC 18 0 

AAATAAAACA TTTTATTTAT CTATTGCCTG CATTCCAAAC CATAACAGTG TTGAGACAGC 240 

TTTAAACATT ACTGTTATTT GCAAGCATCA GCTCCCAATT CGTAAATGTA AAACTCCTTT 3 00 

TGAATTATCA ATGATGTTTT CTGATTTAAA GGAGCCTTAC AACATTATTC ATGATCCTTC 36 0 

ATATCCCCAA AGGATTGTTC ATGCTCTGCT TGAAACTCAC ACATCTTTTG CACAAGTTCT 42 0 
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TTGCAACAAC TTGCAAGAAG ATGTGATCAT CTACACCTTG AACAACCATG AGCTAACTCC 4 80 

TGGAAAGTTA GATTTAGGTG AAATAACTTT GAATTACAAT GAAGACGCCT ACAAAAGGAA 54 0 

ATATTTCCTT TCAAAAACAC TTGAATGTCT TCCATCTAAC ATACAAACTA TGTCTTATTT 600 

AGACAGCATC CAAATCCCTT CCTGGAAGAT AGACTTTGCC AGGGGAGAAA TTAAAATTTC 660 

TCCACAATCT ATTTCAGTTG CAAAATCTTT GTTAAATCTT GATTTAAGCG GGATTAAAAA 72 0 

GAAAGAATCT AAGATTAAGG AAGCATATGC TTCAGGATCA AAATGATCTT GCTGTGTCCA 78 0 

GCTTTTTCTA ATTATGTTAT GTTTATTTTC TTTCTTTACT TATAATTATT TTTCTGTTTG 84 0 

TCATTTCTTT CAAATTCCTC CTGTCTAGTA GAAACCATAA AAACAAAAAT AAAAATAAAA 90 0 

TAAAATCAAA ATAAAATAAA AATCAAAAAA TGAAATAAAA GCAACAAAAA AATTAAAAAA 96 0 

CAAAAAACCA AAAAAGATCC CGAAAGGACA ATTTTGGCCA AATTTGGGGT TTGTTTTTGT 102 0 

TTTTTGTTTT TTTGTTTTTT GTTTTTATTT TTATTTTTAT TTTTATTTTT ATTTTATTTT 1080 

ATTTTATGTT TTTGTTGTTT TTGTTATTTT GTTATTTATT AAGCACAACA CACAGAAAGC 114 0 

AAACTTTAAT TAAACACACT TATTTAAAAT TTAACACACT AAGCAAGCAC AAACAATAAA 12 0 0 

GATAAAGAAA GCTTTATATA TTTATAGGCT TTTTTATAAT TTAACTTACA GCTGCTTTTA 12 6 0 

AGCAAGTTCT GTGAGTTTTG CCTGTTTTTT AACCCCAAAC ATTTCATAGA ACTTGTTAAG 132 0 

GGTTTCACTG TAATGTTCCA TAGCAATACT TCCTTTAGCA TTAGGATTGC TGGAGCTAAG 13 8 0 

TATAGCAGCA TACTCTTTCC CCTTCTTCAC CTGATCTTCA TTCATTTCAA ATGCTTTTCT 144 0 

TTTCAGCACA GTGCAAACTT TTCCTAAGGC TTCCCTGGTG TCATACTTCT TTGGGTCGAT 15 0 0 

CCCGAGATCC TTGTATTTTG CATCCTGATA TATAGCCAAG AC AAC AC TGA TCATCTCAAA 156 0 

GCTATCAACT GAAGCAATAA GAGGTAAGCT ACCTCCCAGC ATTATGGCAA GCCTCACAGA 162 0 

CTTTGCATCA TCAAGAGGTA AT C CATAGGC TTGAATCAAA GGGTGGGAAG CAATCTTAGA 16 8 0 

TTTGATAGTA TTGAGATTCT CAGAATTCC 1709 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



Gin Val Glu Ser Asn Asn Arg Thr Val Asn Ser Leu Ala Val Lys Ser 
1 5 10 15 

Leu Leu Met Ser Ala Glu Asn Asn lie Met Pro Asn Ser Gin Ala Ser 
20 25 30 

Thr Asp Ser His Phe Lys Leu Ser Leu Trp Leu Arg Val Pro Lys Val 
35 40 45 

Leu Lys Gin Val Ser lie Gin Lys Leu Phe Lys Val Ala Gly Asp Glu 
50 55 60 

Thr Asn Lys Thr Phe Tyr Leu Ser He Ala Cys He Pro Asn His Asn 
65 70 75 80 

Ser Val Glu Thr Ala Leu Asn He Thr Val He Cys Lys His Gin Leu 
85 90 95 

Pro He Arg Lys Cys Lys Ala Pro Phe Glu Leu Ser Met Met Phe Ser 
100 105 110 

Asp Leu Lys Glu Pro Tyr Asn He Val His Asp Pro Ser Tyr Pro Lys 
115 120 125 

Gly Ser Val Pro Met Leu Trp Leu Glu Thr His Thr Ser Leu His Lys 
130 135 140 

Phe Phe Ala Thr Asn Leu Gin Glu Asp Val He He Tyr Thr Leu Asn 
145 150 155 160 

Asn Leu Glu Leu Thr Pro Gly Lys Leu Asp Leu Gly Glu Arg Thr Leu 
165 170 175 

Asn Tyr Ser Glu Asp Ala Tyr Lys Arg Lys Tyr Phe Leu Ser Lys Thr 
180 185 190 

Leu Glu Cys Leu Pro Ser Asn Thr Gin Thr Met Ser Tyr Leu Asp Ser 
195 200 205 

He Gin He Pro Ser Trp Lys He Asp Phe Ala Arg Gly Glu He Lys 
210 215 220 

He Ser Pro Gin Ser He Ser Val Ala Lys Ser Leu Leu Lys Leu Asp 
225 230 235 240 

Leu Ser Gly He Lys Lys Lys Glu Ser Lys Val Lys Glu Ala Tyr Ala 
245 250 255 



Ser Gly Ser Lys 
260 



-7- 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

TTAAC AC AC T AAGCAAGCAC AAACAATAAA GATAAAGAAA GCTTTATATA TTTATAGGCT 6 0 

TTTTTATAAT TTAACTTACA GCTGCTTTTA AGCAAGTTCT GTGAGTTTTG CCTGTTTTTT 12 0 

AACCCCAAAC ATTTCATAGA ACTTGTTAAG GGTTTCACTG TAATGTTCCA TAGCAATACT 18 0 

TCCTTTAGCA TTAGGATTGC TGGAGCTAAG TATAGCAGCA TACTCTTTCC CCTTCTTCAC 24 0 

CTGATCTTCA TTCATTTCAA ATGCTTTTCT TTTCAGCACA GTGCAAACTT TTCCTAAGGC 3 00 

TTCCCTGGTG TCATACTTCT TTGGGTCGAT CCCGAGATCC TTGTATTTTG CATCCTGATA 360 

TATAGCCAAG ACAACACTGA TCATCTCAAA GCTATCAACT GAAGCAATAA GAGGTAAGCT 42 0 

ACCTCCCAGC ATTATGGCAA GCCTCACAGA CTTTGCATCA TCAAGAGGTA ATCCATAGGC 480 

TTGACTCAAA GGGTGGGAAG CAATCTTAGA TTTGATAGTA TTGAGATTCT CAGAATTCCC 54 0 

AGTTTCCTCA ACAAGCCTGA CCCTGATCAA GCTATCAAGC CTTCTGAAGG TCATGTCAGT 600 

GGCTCCAATC CTGTCTGAAG TTTTCTTTAT GGTAATTTTA CCAAAAGTAA AATCGCTTTG 66 0 

CTTAATAACC TTCATTATGC TCTGACGATT CTTCAGGAAT GTCAGACATG AAATAATGCT 72 0 

CATCTTTTTG ATCTGGTCAA GGTTTTCCAG ACAAAAAGTC TTGAAGTTGA AT G C T AC C AG 78 0 

ATTCTGATCT TCCTCAAACT CAAGGTCTTT GCCTTGTGTC AACAAAGCAA CAATGCTTTC 84 0 

CTTAGTGAGC TTAAC CAT oizo 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 02 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

AAATTCTCTT GCAGTGAAAT CTCTGCTCAT GTTAGCAGAA AACAACATCA TGCCTAACTC 6 0 

TCAAGCTTTT GTCAAAGCTT CTACTGATTC TAATTTCAAG CTGAGCCTCT GGCTAAGGGT 12 0 

TCCAAAGGTT TTGAAGCAGA TTTCCATTCA GAAATTGTTC AAGGTTGCAG GAGATGAAAC 180 

AAATAAAACA TTTTATTTAT CTATTGCCTG CATTCCAAAC CATAACAGTG TTGAGACAGC 24 0 

TTTAAACATT ACTGTTATTT GCAAGCATCA GCTCCCAATT CGTAAATGTA AAACTCCTTT 3 00 

TGAATTATCA ATGATGTTTT CTGATTTAAA GGAGCCTTAC AACATTATTC ATGATCCTTC 360 

AT AT C C C C AA AGGATTGTTC ATGCTCTGCT TGAAAC TCAC ACATCTTTTG CACAAGTTCT 42 0 

TTGCAACAAC TTGCAAGAAG ATGTGATCAT CTACACCTTG AACAACCATG AGCTAACTCC 48 0 

TGGAAAGTTA GATTTAGGTG AAATAACTTT GAATTACAAT GAAGACGCCT ACAAAAGGAA 54 0 

ATATTTCCTT TCAAAAACAC TTGAATGTCT TCCATCTAAC ATACAAACTA TGTCTTATTT 600 

AGACAGCATC CAAATCCCTT CCTGGAAGAT AGACTTTGCC AGGGGAGAAA TTAAAATTTC 66 0 

TCCACAATCT ATTTCAGTTG CAAAATCTTT GTTAAATCTT GATTTAAGCG GGATTAAAAA 72 0 

GAAAGAATCT AAGATTAAGG AAGCATATGC TTCAGGATCA AAATGATCTT GCTGTGTCCA 78 0 

GCTTTTTCTA ATTATGTTAT GTTTATTTTC TTTCTTTACT TATAATTATT TTTCTGTTTG 84 0 

TCATTTCTTT CAAATTCCTC CTGTCTAGTA GAAACCATAA AAACAAAAAT AAAAATAAAA 900 

TAAAATCAAA ATAAAATAAA AATCAAAAAA TGAAATAAAA GCAACAAAAA AATTAAAAAA 96 0 

CAAAAAACCA AAAAAGATCC CGAAAGGACA ATTTTGGCCA AATTTGGGGT TTGTTTTTGT 102 0 

TTTTTGTTTT TTTGTTTTTT GTTTTTATTT TTATTTTTAT TTTTATTTTT ATTTTATTTT 108 0 

ATTTTATGTT TTTGTTGTTT TTGTTATTTT GTTATTTATT AAGCACAACA CACAGAAAGC 114 0 

AAACTTTAAT TAAACACACT TATTTAAAAT TTAACACACT AAGCAAGCAC AAACAATAAA 12 0 0 

GATAAAGAAA GCTTTATATA TTTATAGGCT TTTTTATAAT TTAACTTACA GCTGCTTTTA 12 6 0 

AGCAAGTTCT GTGAGTTTTG CCTGTTTTTT AACCCCAAAC ATTTCATAGA ACTTGTTAAG 132 0 

GGTTTCACTG TAATGTTCCA TAGCAATACT TCCTTTAGCA TTAGGATTGC TGGAGCTAAG 13 8 0 

TATAGCAGCA TACTCTTTCC CCTTCTTCAC CTGATCTTCA TTCATTTCAA ATGCTTTTCT 144 0 

TTTCAGCACA GTGCAAACTT TTCCTAAGGC TTCCCTGGTG TCATACTTCT TTGGGTCGAT 15 0 0 

CCCGAGATCC TTGTATTTTG CATCCTGATA TATAGCCAAG ACAACACTGA TCATCTCAAA 156 0 

GCTATCAACT GAAGCAATAA GAGGTAAGCT ACCTCCCAGC ATTATGGCAA GCCTCACAGA 162 0 
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CTTTGCATCA TCAAGAGGTA ATC CATAGGC TTGACTCAAA GGGTGGGAAG CAATCTTAGA 1680 

TTTGATAGTA TTGAGATTCT CAGAATTCGC AGTTTCCTCA ACAAGCCTGA CCCTGATCAA 174 0 

GCTATCAAGC CTTCTGAAGG TCATGTCAGT GGCTCCAATC CTGTCTGAAG TTTTCTTTAT 1800 

GGTAATTTTA CCAAAAGTAA AATCGCTTTG CTTAATAACC TTCATTATGC TCTGACGATT 186 0 

CTTCAGGAAT GTCAGACATG AAATAATGCT CATCTTTTTG ATCTGGTCAA GGTTTTCCAG 192 0 

ACAAAAAGTC TTGAAGTTGA ATGCTACCAG ATTCTGATCT TCCTCAAACT CAAGGTCTTT 198 0 

GCCTTGTGTC AACAAAGCAA CAATGCTTTC CTTAGTGAGC TTAACCAT 2 02 8 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10; 
TTCTGGTCTT CTTCAAACTC A 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTGTAGCCAT GAGCAAAG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



- 10- 



(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ser Ser Gly Val Tyr Glu Ser lie lie Gin Thr Lys Ala Ser Val 
15 10 15 

Trp Gly Ser Thr Ala Ser Gly Lys Ser lie Val Asp Ser Tyr Trp lie 
20 25 30 

Tyr Glu Phe Pro Thr Gly Ser Pro Leu Val Gin Thr Gin Leu Tyr Ser 
35 40 45 

Asp Ser Arg Ser Lys Ser Ser Phe Gly Tyr Thr Ser Lys lie Gly Asp 
50 55 60 

lie Pro Ala Val Glu Glu Glu lie Leu Ser Gin Asn Val His He Pro 
65 70 75 80 

Val Phe Asp Asp He Asp Phe Ser He Asn He Asn Asp Ser Phe Leu 
85 90 95 

Ala He Ser Val Cys Ser Asn Thr Val Asn Thr Asn Gly Val Lys His 
100 105 110 

Gin Gly His Leu Lys Val Leu Ser Leu Ala Gin Leu His Pro Phe Glu 
115 120 125 

Pro Val Met Ser Arg Ser Glu He Ala Ser Arg Phe Arg Leu Gin Glu 
130 135 140 

Glu Asp He He Pro Asp Asp Lys Tyr He Ser Ala Ala Asn Lys Gly 
145 150 155 160 

Ser Leu Ser Cys Val Lys Glu His Thr Tyr Lys Val Glu Met Ser His 
165 170 175 

Asn Gin Ala Leu Gly Lys Val Asn Val Leu Ser Pro Asn Arg Asn Val 
180 185 190 

His Glu Trp Leu Tyr Ser Phe Lys Pro Asn Phe Asn Gin He Glu Ser 
195 200 205 

Asn Asn Arg Thr Val Asn Ser Leu Ala Val Lys Ser Leu Leu Met Ala 
210 215 220 

Thr Glu Asn Asn He Met Pro Asn Ser Gin Ala Phe Val Lys Ala Ser 
225 230 235 240 

Thr Asp Ser His Phe Lys Leu Ser Leu Trp Leu Arg He Pro Lys Val 
245 250 255 



Leu Lys Gin He Ala He Gin Lys Leu Phe Lys Phe Ala Gly Asp Glu 
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260 265 270 

Thr Gly Lys Ser Phe Tyr Leu Ser lie Ala Cys lie Pro Asn His Asn 
275 280 285 

Ser Val Glu Thr Ala Leu Asn Val Thr Val lie Cys Arg His Gin Leu 
290 295 300 

Pro lie Pro Lys Ser Lys Ala Pro Phe Glu Leu Ser Met lie Phe Ser 
305 310 315 320 

Asp Leu Lys Glu Pro Tyr Asn Thr Val His Asp Pro Ser Tyr Pro Gin 
325 330 335 

Arg lie Val His Ala Leu Leu Glu Thr His Thr Ser Phe Ala Gin Val 
340 345 350 

Leu Cys Asn Lys Leu Gin Glu Asp Val lie He Tyr Thr He Asn Ser 
355 360 365 

Pro Glu Leu Thr Pro Ala Lys Leu Asp Leu Gly Glu Arg Thr Leu Asn 
370 375 380 

Tyr Ser Glu Asp Ala Ser Lys Lys Lys Tyr Phe Leu Ser Lys Thr Leu 
385 390 395 400 

Glu Cys Leu Pro Val Asn Val Gin Thr Met Ser Tyr Leu Asp Ser He 
405 410 415 

Gin He Pro Ser Trp Lys He Asp Phe Ala Arg Gly Glu He Arg He 
420 425 430 

Ser Pro Gin Ser Thr Pro He Ala Arg Ser Leu Leu Lys Leu Asp Leu 
435 440 445 

Ser Lys He Lys Glu Lys Lys Ser Leu Thr Trp Glu Thr Ser Ser Tyr 
450 455 460 

Asp Leu Glu 
465 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
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Met Ser Lys Val Lys Leu Thr Lys Glu Asn lie Val Ser Leu Leu Thr 
15 10 15 

Gin Ser Ala Asp Val Glu Phe Glu Glu Asp Gin Asn Gin Val Ala Phe 
20 25 30 

Asn Phe Lys Thr Phe Cys Gin Glu Asn Leu Asp Leu lie Lys Lys Met 
35 40 45 

Ser lie Thr Ser Cys Leu Thr Phe Leu Lys Asn Arg Gin Gly lie Met 
50 55 60 

Lys Val Val Asn Gin Ser Asp Phe Thr Phe Gly Lys Val Thr lie Lys 
65 70 75 80 

Lys Asn Ser Glu Arg Val Gly Ala Lys Asp Met Thr Phe Arg Arg Leu 
85 90 95 

Asp Ser Met lie Arg Val Lys Leu lie Glu Glu Thr Ala Asn Asn Glu 
100 105 110 

Asn Leu Ala He He Lys Ala Lys He Ala Ser His Pro Leu Val Gin 
115 120 125 

Ala Tyr Gly Leu Pro Leu Ala Asp Ala Lys Ser Val Arg Leu Ala He 
130 135 140 

Met Leu Gly Gly Ser He Pro Leu He Ala Ser Val Asp Ser Phe Glu 
145 150 155 160 

Met He Ser Val Val Leu Ala He Tyr Gin Asp Ala Lys Tyr Lys Glu 
165 170 175 

Leu Gly He Glu Pro Thr Lys Tyr Asn Thr Lys Glu Ala Leu Gly Lys 
180 185 190 

Val Cys Thr Val Leu Lys Ser Lys Gly Phe Thr Met Asp Asp Ala Gin 
195 200 205 

He Asn Lys Gly Lys Glu Tyr Ala Lys He Leu Ser Ser Cys Asn Pro 
210 215 220 

Asn Ala Lys Gly Ser He Ala Met Asp Tyr Tyr Ser Asp Asn Leu Asp 
225 230 235 240 

Lys Phe Tyr Glu Met Phe Gly Val Lys Lys Glu Ala Lys He Ala Gly 
245 250 255 

Val Ala 



INFORMATION FOR SEQ ID NO : 14 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

AGAGCAATTG GGTCATTTTT TATTCTAAAT CGAACCTCAA CTAGCAAATC TCAGAACTGT 6 0 

AATAAGCACA AGAGCACAAG AGCCACAATG TCATCAGGTG TTTATGAATC GATCATTCAG 12 0 

ACAAAGGCTT CAGTTTGGGG AT CGACAGCA TCTGGTAAGT CCATCGTGGA TTCTTACTGG 18 0 

ATTTATGAGT TTCCAACTGG TTCTCCACTG GTTCAAACTC AGTTGTACTC TGATTCGAGG 24 0 

AGCAAAAGTA GCTTCGGCTA CACTTCAAAA ATTGGTGATA TTCCTGCTGT AGAGGAGGAA 3 00 

ATTTTATCTC AGAACGTTCA TATCCCAGTG TTTGATGATA TTGATTTCAG CATCAATATC 3 60 

AATGATTCTT TCTTGGCAAT TTCTGTTTGT TC CAACAC AG TTAACAC C AA TGGAGTGAAG 42 0 

CATCAGGGTC ATCTTAAAGT TCTTTCTCTT GCCCAATTGC ATCCCTTTGA ACCTGTGATG 4 80 

AGCAGGTCAG AGATTGCTAG CAGATTCCGG CTCCAAGAAG AAGATATAAT TCCTGATGAC 54 0 

AAATATATAT CTGCTGCTAA CAAGGGATCT CTCTCCTGTG TCAAAGAACA TAC TTACAAA 600 

GTCGAAATGA GCCACAATCA GGCTTTAGGC AAAGTGAATG TTCTTTCTCC TAACAGAAAT 66 0 

GTTCATGAGT GGCTGTATAG TTTCAAACCA AATTTCAACC AGATCGAAAG TAATAACAGA 72 0 

ACTGTAAATT CTCTTGCAGT CAAATCTTTG CTCATGGCTA CAGAAAACAA CATTATGCCT 78 0 

AACTCTCAAG CTTTTGTTAA AGCTTCTACT GATTCTCATT TTAAGTTGAG CCTTTGGCTG 84 0 

AGAATTCCAA AAGTTTTGAA GCAAATAGCC ATACAGAAGC TCTTCAAGTT TGCAGGAGAC 90 0 

GAAACCGGTA AAAGTTTCTA TTTGTCTATT GCATGCATCC CAAATCACAA CAGTGTGGAA 96 0 

ACAGCTTTAA ATGTCACTGT TATATGTAGA CATCAGCTTC CAATCCCTAA GTCCAAAGCT 102 0 

CCTTTTGAAT TATCAATGAT TTTCTCCGAT CTGAAAGAGC CTTACAACAC TGTGCATGAT 108 0 

CCTTCATATC CTCAAAGGAT TGTTCATGCT TTGCTTGAGA CTCACACTTC CTTTGCACAA 114 0 

GTTCTCTGCA ACAAGCTGCA AGAAGATGTG AT CAT AT AT A CTATAAACAG CCCTGAACTA 12 0 0 

ACCCCAGCTA AGCTGGATCT AGGTGAAAGA ACCTTGAACT ACAGTGAAGA TGCTTCGAAG 126 0 

AAGAAGTATT TTCTTTCAAA AACACTCGAA TGCTTGCCAG TAAATGTGCA GACTATGTCT 13 2 0 

TATTTGGATA GCATCCAGAT TCCTTCATGG AAGATAGACT TTGCCAGAGG AGAGATCAGA 13 8 0 
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ATCTCCCCTC AATCTACTCC TATTGCAAGA TCTTTGCTCA AGCTGGATTT GAGCAAGATC 1440 

AAGGAAAAGA AGTCCTTGAC TTGGGAAACA TCCAGCTATG ATCTAGAATA AAAGTGGCTC 15 0 0 

ATACTACTCT AAGTAGTATT TGTCAACTTG CTTATCCTTT ATGTTGTTTA TTTCTTTTAA 156 0 

ATCTAAAGTA AGTTAGATTC AAGTAGTTTA GTATGCTATA GCATTATTAC AAAAAATACA 162 0 

AAAAAATACA AAAAAATACA AAAAATATAA AAAACCCAAA AAGATCCCAA AAGGGACGAT 168 0 

TTGGTTGATT TACTCTGTTT TAGGCTTATC TAAGCTGCTT TTGTTTGAGC AAAATAACAT 174 0 

TGTAACATGC AATAACTGGA ATTTAAAGTC CTAAAAGAAG TTTCAAAGGA CAGCTTAGCC 18 00 

AAAATTGGTT TTTGTTTTTG TTTTTTTGTT TTTTGTTTTT TTGTTTTATT TTTATTTTTA 186 0 

GTTTATTTTT TGTTTTTGTT ATTTTTATTT TTATTTTATT TTCTTTTATT TTATTTATAT 192 0 

AT AT AT C AAA CACAATCCAC ACAAATAATT TTAATTTCAA ACATTCTACT GATTTAACAC 198 0 

ACTTAGCCTG ACTTTATCAC ACTTAACACG CTTAGTTAGG CTTTAACACA CTGAACTGAA 2 04 0 

TTAAAACACA CTTAGTATTA TGCATCTCTT AATTAACACA CTTTAATAAT ATGCATCTCT 210 0 

GAATCAGCCT TAAAGAAGCT TTTATGCAAC ACCAGCAATC TTGGCCTCTT TCTTAACTCC 216 0 

AAACATTTCA TAGAATTTGT CAAGATTATC ACTGTAATAG TCCATAGCAA TGCTTCCCTT 22 2 0 

AGCATTGGGA TTGCAAGAAC TAAGTATCTT GGCATATTCT TTCCCTTTGT TTATCTGTGC 22 8 0 

AT CAT C C ATT GTAAATCCTT TGCTTTTAAG CACTGTGCAA ACCTTCCCCA GAGCTTCCTT 2340 

AGTGTTGTAC TTAGTTGGTT CAATCCCTAA CTCCTTGTAC TTTGCATCTT GATATATGGC 24 0 0 

AAGAACAACA CTGATCATCT CGAAGCTGTC AACAGAAGCA ATGAGAGGGA TACTACCTCC 246 0 

AAGCATTATA GCAAGTCTCA CAGATTTTGC ATCTGCCAGA GGCAGCCCGT AAGCTTGGAC 252 0 

CAAAGGGTGG GAGGCAATTT TTGCTTTGAT AATAGCAAGA TTCTCATTGT TTGCAGTCTC 258 0 

TTCTATGAGC TTCACTCTTA TCATGCTATC AAGCCTCCTG AAAGTCATAT CCTTAGCTCC 264 0 

AACTCTTTCA GAATTTTTCT TTATCGTGAC CTTAC CAAAA GTAAAATCAC TTTGGTTCAC 2 7 00 

AAC TTTCATA ATGCCTTGGC GATTCTTCAA GAAAGTCAAA CATGAAGTGA TACTCATTTT 2 76 0 

CTTAATCAGG TCAAGATTTT CCTGACAGAA AGTCTTAAAG TTGAATGCGA CCTGGTTCTG 2 82 0 

GTCTTCTTCA AACTCAACAT CTGCAGATTG AGTTAAAAGA GAGACAATGT TTTCTTTTGT 2 88 0 

GAGCTTGACC TTAGACATGG TGGCAGTTTA GAT CTAGAC C TTTCTCGAGA GATAAGATT C 2 94 0 

AAGGTGAGAA AGTGCAACAC TGTAGACCGC GGTCGTTACT TATCCTGTTA ATGTGATGAT 3 000 
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TTGTATTGCT GAGTATTAGG TTTTTGAATA AAATTGACAC AATTGCTCT 3 049 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

TTATGCAACA CCAGCAATCT TGGCCTCTTT CTTAACTCCA AACATTTCAT AGAATTTGTC 6 0 

AAGATTATCA CTGTAATAGT C CATAGC AAT GCTTCCCTTA GCATTGGGAT TGCAAGAACT 12 0 

AAGTATCTTG GCATATTCTT TCCCTTTGTT TATCTGTGCA TCATCCATTG TAAATCCTTT 18 0 

GCTTTTAAGC ACTGTGCAAA CCTTCCCCAG AGCTTCCTTA GTGTTGTACT TAGTTGGTTC 24 0 

AATCCCTAAC TCCTTGTACT TTGCATCTTG ATATATGGCA AGAACAACAC TGATCATCTC 300 

G AAGCTGT C A ACAGAAGCAA TGAGAGGGAT ACTACCTCCA AGCATTATAG CAAGTCTCAC 36 0 

AGATTTTGCA TCTGCCAGAG GCAGCCCGTA AGCTTGGACC AAAGGGTGGG AGGCAATTTT 42 0 

TGCTTTGATA ATAGCAAGAT TCTCATTGTT TGCAGTCTCT TCTATGAGCT TCACTCTTAT 480 

CATGCTATCA AGCCTCCTGA AAGTCATATC CTTAGCTCCA ACTCTTTCAG AATTTTTCTT 54 0 

TATCGTGACC TTACCAAAAG TAAAATCACT TTGGTTCACA ACTTTCATAA TGCCTTGGCG 60 0 

ATTCTTCAAG AAAGTCAAAC ATGAAGTGAT ACTCATTTTC TTAATCAGGT CAAGATTTTC 66 0 

CTGACAGAAA GTCTTAAAGT TGAATGCGAC CTGGTTCTGG TCTTCTTCAA ACTCAACATC 72 0 

TGCAGATTGA GTTAAAAGAG AGACAATGTT TTCTTTTGTG AGCTTGACCT TAGACAT 7 77 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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18 



GTTCTGAGAT TTGCTAGT 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
TTATATCTTC TTCTTGGA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



ii) MOLECULE TYPE: DNA (genomic) 



(Ki) SEQUENCE DESCRIPTION: SEQ ID NO : 1 8 : 
ATGT CAT CAG GTGTTTATGA ATCGATCATT CAGACAAAGG CTTCAGTTTG GGGATCGACA 
GCATCTGGTA AGTCCATCGT GGATTCTTAC TGGATTTATG AGTTTCCAAC TGGTTCTCCA 
CTGGTTCAAA CTCAGTTGTA CTCTGATTCG AGGAGCAAAA GTAGCTTCGG CTACACTTCA 
AAAATTGGTG ATATTCCTGC TGTAGAGGAG GAAATTTTAT CTCAGAACGT T CAT AT C CC A 
GTGTTTGATG ATATTGATTT CAGCATCAAT ATCAATGATT CTTTCTTGGC AATTTCTGTT 
TGTTCCAACA CAGTTAACAC CAATGGAGTG AAGCATCAGG GTCATCTTAA AGTTCTTTCT 
CTTGCCCAAT TGCATCCCTT TGAACCTGTG ATGAGCAGGT CAGAGATTGC TAGCAGATTC 
CGGCTCCAAG AAGAAGATAT AATTCCTGAT GACAAATATA TATCTGCTGC TAACAAGGGA 
TCTCTCTCCT GTGT C AAAG A ACATACTTAC AAAGTCGAAA TGAGCCACAA TCAGGCTTTA 
GGCAAAGTGA ATGTTCTTTC TCCTAACAGA AATGTTCATG AGTGGCTGTA TAGTTTCAAA 
CCAAATTTCA ACCAGATCGA AAGTAATAAC AGAACTGTAA ATTCTCTTGC AGTCAAATCT 



18 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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TTGCTCATGG CTACAGAAAA CAACATTATG CCTAACTCTC AAGCTTTTGT TAAAGCTTCT 72 0 

ACTGATTCTC ATTTTAAGTT GAGCCTTTGG CTGAGAATTC CAAAAGTTTT GAAGCAAATA 780 

GC CATACAGA AGCTCTTCAA GTTTGCAGGA GACGAAACCG GTAAAAGTTT CTATTTGTCT 84 0 

ATTGCATGCA TCCCAAATCA CAACAGTGTG GAAACAGCTT TAAATGTCAC TGTTATATGT 90 0 

AGACATCAGC TTCCAATCCC TAAGTC CAAA GCTCCTTTTG AATTATCAAT GATTTTCTCC 96 0 

GATCTGAAAG AGCCTTACAA CACTGTGCAT GATCCTTCAT ATCCTCAAAG GATTGTTCAT 102 0 

GCTTTGCTTG AGACTCACAC TTCCTTTGCA CAAGTTCTCT GCAACAAGCT GCAAGAAGAT 10 8 0 

GTGATCATAT ATACTATAAA CAGCCCTGAA CTAACCCCAG CTAAGCTGGA TCTAGGTGAA 114 0 

AGAACCTTGA ACTACAGTGA AGATGCTTCG AAGAAGAAGT ATTTTCTTTC AAAAACACTC 12 0 0 

GAATGCTTGC CAGTAAATGT GCAGAC TATG TCTTATTTGG ATAGCATCCA GATTCCTTCA 126 0 

TGGAAGATAG ACTTTGCCAG AGGAGAGATC AGAATCTCCC CTCAATCTAC TCCTATTGCA 132 0 

AGATCTTTGC TCAAGCTGGA TTTGAGCAAG ATCAAGGAAA AGAAGTCCTT GACTTGGGAA 13 80 

ACATCCAGCT ATGATCTAGA A 14 01 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATGTCTAAGG TCAAGCTCAC AAAAGAAAAC ATTGTCTCTC TTTTAACTCA ATCTGCAGAT 60 

GTTGAGTTTG AAGAAGACCA GAAC CAGGTC GCATTCAACT TTAAGACTTT CTGTCAGGAA 12 0 

AATCTTGACC TGATTAAGAA AATGAGTATC ACTTCATGTT TGACTTTCTT GAAGAATCGC 18 0 

CAAGGCATTA TGAAAGTTGT GAACCAAAGT GATTTTACTT TTGGTAAGGT CACGATAAAG 24 0 

AAAAATTCTG AAAGAGTTGG AGCTAAGGAT ATGACTTTCA GGAGGCTTGA TAGCATGATA 300 

AGAGTGAAGC TCATAGAAGA GACTGCAAAC AATGAGAATC TTGCTATTAT CAAAGCAAAA 36 0 

ATTGCCTCCC ACCCTTTGGT CCAAGCTTAC GGGCTGCCTC TGGCAGATGC AAAATCTGTG 42 0 

AGACTTGCTA TAATGCTTGG AGGTAGTATC CCTCTCATTG CTTCTGTTGA CAGCTTCGAG 480 
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ATGATCAGTG TTGTTCTTGC CATATATCAA GATGCAAAGT ACAAGGAGTT AGGGATTGAA 



540 



C CAACTAAGT ACAACACTAA GGAAGCTCTG GGGAAGGTTT GCACAGTGCT TAAAAGCAAA 



600 



GGATTTACAA TGGATGATGC ACAGATAAAC AAAGGGAAAG AATATGCCAA GATAC TTAGT 



660 



TCTTGCAATC CCAATGCTAA GGGAAGCATT GCTATGGACT ATTACAGTGA TAATCTTGAC 



720 



AAATTCTATG AAATGTTTGG AGTTAAGAAA GAGGCCAAGA TTGCTGGTGT TGCATAA 



777 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TACTTATCTA GAACCATGGA CAAAGCAAAG ATTACCAAGG 4 0 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TACAGTGGAT CCATGGTTAT TTCAAATAAT TTATAAAAGC AC 42 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
AGCATTGGAT CCATGGTTAA CACACTAAGC AAGCAC 36 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
AGCTAAT C TA GAACCATGGA TGACTCACTA AGGAAAGCAT TGTTGC 46 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CCCACTATCC TTCGCAAGAC CC 22 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TACAGTGGAT CCATGGTTAA GGTAATCCAT AGGCTTGAC 3 9 



(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
AGC TAAC CAT GGTTAAGCTC ACTAAGGAAA GCATTGTTGC 



(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
AGCTAATCTA GAACCATGGA TGACT CACTA AGGAAAGCAT TGTTGC 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
AGCATTGGAT CCATGGTTAA CACACTAAGC AAGCAC 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TACAGTTCTA GAACCATGGA TGATGCAAAG TCTGTGAGG 3 9 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AGATTCTCTA GACCATGGTG ACTTGATGAG CAAAGTCTGT GAGGCTTGC 



49 



